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FORM PTO-1390 U.S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 
(REV 10-2000) 

TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C371 


ATTORNEY'S DOCKET NUMBER 

RII-003CPUS 


U.S. APPLICATION NO. (If known, see 37 CFR 1.5) 

Q9/71 9777 


INTERNATIONAL APPLICATION 

PCT/US99/12929 


INTERNATIONAL FILING DATE 

09 June 1999 (09.06.99) 


PRIORITY DjCTE tXAIMEE) 

09 June 1998 (09.06.98) 
11 June 1998 (11.06.98) 


TITLE OF INVENTION 

NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 


APPLICANT(S) FOR DO/EO/US 

Andrea B. BRANCH, Jose L. WALEWSKI, and Dechard D. STUMP 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 



1 . 13 This is a FIRST submission of items concerning a filing under 35 U.S.C.371 . 
fl. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 

3. □ This is an express request to promptly begin national examination procedures (35 U.S.C. 371(f)). 
* 4. 13 The US has been elected by the expiration of 1 9 months from the priority date (PCT Article 3 1 ). 
5. 13 A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. □ is attached hereto (required only if not communicated by the International Bureau) 
has been communicated by the International Bureau. 
Cj c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

M 6. □ An English language translation of the International Application as filed (35 U.S.C 371(c)(2)). 
*y 7. 13 Amendments to the claims of the International Application under PCT Article 19 (35 U S C 371(c)(3)) 
'lH a. □ are attached hereto (required only if not communicated by the International Bureau) 

u b. □ have been communicated by the International Bureau. 

c. O have not been made; however, the time limit for making such amendments has NOT expired, 
w d. 13 have not been made and will not be made. 

r? 8. □ An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

y 9. 13 An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). (unexecuted) (4 sheets) 

Q 10. □ An English language translation of the annexes to the International Preliminary Examination Report under 
U PCT Article 36 (35 U.S.C. 371(c)(5)). 

Items 11. to 16. below concern document(s) or information included: 

11. D An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 

included 

13. n A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. n A substitute specification. 

15. IZI A change of power of attorney and/or address letter. 

16. 13 Other items or information: Transmittal Letter (2 sheets in duplicate); PCT International Published 
Application (WO 99/63941) (without International Search Report) (51 sheets); PCT International 
Published Application Cover Sheet (WO 99/63941)(with International Search Report attached) 

(3 sheets); Sequence Listing (3 sheets); Diskette Containing Sequence Listing; Check ($520) 
based on small entity status; Certificate of Express Mailing (1 sheet); and Return Postcard. 
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U.S. APPLICATION NO. (if known, see 37 CFR 1.5) 



09/719277 



INTERNATIONAL APPLICATION NO. 

PCT/US99/12929 



17. 13 The following fees are submitted: 

BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5) ) .(a/o November 1, 2000): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO $1000 

International preliminary examination fee (37 CFR 1.482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $860 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO but 
international search fee (37 CFR 1.455(a)(2)) paid to USPTO $710 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) $690 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



ATTORNEY'S DOCKET NO. 

RII-003CPUS 



CALCULATIONS PTO USE ONLY 



$860 



Surcharge of $130.00 for furnishing the oath or declaration later than D 20 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



□ 30 



$-- 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



TMal claims 



30 



-20 = 



10 



X $18.00 



$180 



Independent claims 



-3 



0 



X $80.00 



$0 



i MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ 270.00 



TOTAL OF ABOVE CALCULATIONS = 



$1040 



Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
are reduced by 1/2. 



$520 



SUBTOTAL = 



$520 



Processing fee of $130.00 for furnishing the English translation later than □ 20 □ 30 
ip&nths from the earliest claimed priority date (37 CFR 1.492(f)). + 



TOTAL NATIONAL FEE = 



$520 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property + 



$-- 



TOTAL FEES ENCLOSED = 



$520 



Amount to be: 
refunded 



charged 



A check in the amount of $520.00 



b. □ 

c. H 



to cover the above fees is enclosed, 
in the amount of $ 



to cover the above fees. 



Please charge my Deposit Account No. 

A duplicate copy of this sheet is enclosed. 

The Commissioner is hereby authorized to charge any additional fees which may be required, or credit 
any overpayment to Deposit Account No. 12-0080 A duplicate copy of this sheet is enclosed. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO: 

Giulio A. DeConti, Esq. 
LAHIVE & COCKFIELD, LLP 
28 State Street 
Boston, Massachusetts 02109 
United States of America 
Date: 08 December 2000 



SIGNATU: 



egan E. Willams 



NAME 



43,270 



REGISTRATION NUMBER 
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IN THE UNITED STATES PATENT DESIGNATED OFFICE (DO/US) 
(National Phase of International App.: PCT/US99/12929) 

In re the 

application of: Andrea D. BRANCH et al. 

International Application No.: PCT/US99/12929 
International Filing Date: 09 JUNE 1999 
U.S. Serial No.: Not yet assigned 
Filed: Herewith 

For: NOVEL HEPATITIS C VIRUS PEPTIDES 
AND USES THEREOF 

Attorney Docket No.: RII-003CPUS 



Assistant Commissioner for Patents 
BOX PCT 

Washington, D.C. 20231 

Certification Un der 37 C FR 1.10 
I hereby certify that the attached: 

Transmittal Letter (2 sheets in duplicate); Declaration, Petition and Power of 
Attorney (unexecuted) (4 sheets); PCT International Published Application 
(WO 99/63941) (without International Search Report) (51 sheets); Cover sheet of 
PCT International Published Application (WO 99/63941) (with International 
Search Report attached) (3 sheets); Sequence Listing (3 sheets); Diskette 
Containing Sequence Listing; Check ($520) based on small entity; Certificate of 
Express Mailing (1 sheet); and Return Postcard are being deposited by me with the 
United States Postal Service "Express Mail Post Office to Addressee" service, 
Mailing Label No. EL 589 754 944 US, under 37 CFR 1.10 on the date indicated 
below and is addressed to the Assistant Commissioner for Patents, Box PCT, 
Washington, D.C. 20231. 



Date: 08 December 2000 



Name: Nelson F. Barros Signature: '/[ffflnsWl ^Q/}/l/Q ^\ 



R1I-003CPPC 



09/719277 

1_ JCOIRecMPCT/PTO 08 DEC 



SEQUENCE LISTING 
<110> Andrea Branch, Jose Walewski, and Dechard Stump 
5 <120> NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 

<130> RII-003PC 

10 <140> 
<141> 

<160> 8 

15 <170> Patentln Ver. 2.0 

<210> 1 

< 2 1 1 > 595 
<212> DNA 

20 <213> Hepatitis C virus 

p <400> I 

%D gcacgaatcc taaacctcaa agaaaaacca aacgtaacac caaccgtcgc ccacaggacg 60 

C] tcaagttccc gggtggcggt cagatcgttg gtggagttta cttgttgccg cgcaggggcc 120 

25 ctagattggg tgtgcgcgcg acgaggaaga cttccgagcg gtcgcaacct cgaggtagac 180 

r ~ gtcagcctat ccccaaggca cgtcggcccg agggcaggac ctgggctcag cccgggtacc 240 

'jj; cttggcccct ctatggcaat gagggttgcg ggtgggcggg atggctcctg tctccccgtg 300 

*;y gctctcggcc tagctggggc cccacagacc cccggcgtag gtcgcgcaat ttgggtaagg 360 

"H tcatcgatac ccttacgtgc ggcttcgccg acctcatggg gtacataccg ctcgtcggcg 420 

cccctcttgg aggcgctgcc agggccctgg cgcatggcgt ccgggttctg gaagacggcg 480 

s tgaactatgc aacagggaac cttcctggtt gctctttctc tatcttcctt ctggccctgc 54 0 

Q tctcttgcct gactgtgccc gcttcagcct accaagtgcg caattcctcg gggct 595 

Zi <210> 2 ~ 
§5 <211> 196 
<212> PRT 
J=f <213> Hepatitis C virus 

<400> 2 

40 Ala Arg He Leu Asn Leu Lys Glu Lys Pro Asn Vai Thr Pro Thr Val 
15 10 15 

Ala His Arg Thr Ser Ser Ser Arg Val Ala Val Arg Ser Leu Val Glu 
20 25 30 

Phe Thr Cys Cys Arg Ala Gly Ala Leu Asp Trp Val Cys Ala Arg Arg 
35 40 45 



45 



Gly Arg Leu Pro Ser Gly Arg Asn Leu Glu Val Asp Val Ser Leu Ser 

50 50 55 60 

Pro Arg His Val Gly Pro Arg Ala Gly Pro Gly Leu Ser Pro Gly Thr 

65 70 75 80 

55 Leu Gly Pro Ser Met Ala Met Arg Val Ala Gly Gly Arg Asp Gly Ser 

85 90 95 



RII-003CPPC 



-2- 

Cys Leu Pro Val Ala Leu Gly Leu Ala Gly Ala Pro Gin Thr Pro Gl y 
100 105 110 

Val Gly Arg Ala He Trp Val Arg Ser Ser He Pro Leu Arg Ala Ala 
5 115 120 125 

Ser Pro Thr Ser Trp Gly Thr Tyr Arg Ser Ser Ala Pro Leu Leu Glu 
130 135 140 

10 Ala Ala Pro Gly Pro Trp Arg Met Ala Ser Gly Phe Trp Lys Thr Ala 
145 150 155 160 

Thr Met Gin Gin Gly Thr Phe Leu Val Ala Leu Ser Leu Ser Ser Phe 
165 170 175 

15 

Trp Pro Cys Ser Leu Ala Leu Cys Pro Leu Gin Pro Thr Lys Cys Ala 
180 185 190 

He Pro Arg Gly 
20 195 



J3 <210> 3 

S| <211> 13 

pj <212> PRT 

<213> Hepatitis C virus 

: j; <220> 

,1 <223> at location 8, X is N or K 

% 

L <220> 

y <223> at location 9, X is V or E 

M= <220> 

Sf <223> at location 13, X is A or V 
U <400> 3 

Leu Asn Leu Lys Glu Lys Pro Xaa Xaa Thr Pro Thr Xaa 
15 10 

40 

<210> 4 
<21i> 13 
<212> PRT 
45 <213> Hepatitis C virus 

<220> 

<223> at location 6, X is L or S 

50 <220> 

<223> at location 10, X is A or V 

<220> 

<223> at location 11, X is A or V 

55 

^<400> 4 

Ala Ala His Arg Thr Xaa Ser Ser Arg Xaa Xaa Val Arg 
1 5 10 



RII-003CPPC 



-3 



<210> 5 
<211> 14 
5 <212> PRT 

<213> Hepatitis C virus 

<400> 5 

Leu Asn Leu Lys Glu Lys Pro Asn Val Thr Pro Thr Ala Cys 
10 1 5 10 



<210> 6 

<211> 14 

15 <212> PRT 

<213> Hepatitis C virus 

<400> 6 

Ala Ala His Arg Thr Ser Ser Ser Arg Ala Val Val Arg Cys 
20 1 5 10 



<210> 7 
<211> 18 
2j <212> PRT 

<213> Hepatitis C virus 

J y <400> 7 

Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp 
3« 1 5 10 15 

O Thr Cys 

S <210> 8 

i:J <21i> 20 

^ <212> PRT 

<213> Hepatitis C virus 

40 

<400> 8 

Gly Cys Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg 
1 5 10 m 



45 Arg Ala Pro lie 

20 




: W^~r^ : - v CRF Errors Corrctot^d-By tho SUC'Systoms Branch / ■ =fe^": 

SO*, Mun*or: 09/119,211 £/W T c~»^^^/ 

□ Changed a file from ^non-ASCII to ASCH ^ M ' fc ft £ ff*^ lsl ^~^^^ 

[ — I Changed the margins in cases where the sequence text was gapped: down^l^o^ line. 
| — j Edited a format error in the Cunent Application Data section, speaficafly: 

'CUT Edited me Currenl Application Data section with the actuaJ cunent number. The number inputted by the 
applicant was □ the prior application data; or □ other w . 

| | Added the mandatory heading and subheadings for "Current Application Data". 

j j Edited the 'Number of Sequences' field. The applicant spelled out a number instead of using an integer. 

j j Changed tho spelling of a mandatory field (the headings ofsubheadmgs). specfticd*/:. 

[ ) Conected tho SEQ ID NO when obviously incorrect. Tho sequence gumbers that were edited were: 



Inserlod or corrected a nucleic number at the end of a nucleic lino. SEQ ID NO's edited: 

f~^p Conocted subheading placement. All responses must bo on tho same line as each subheading. If the 

|£ applicant placed a response below tho subheading, this was moved to its appropnato place. 



Inserled colons after headings/subheadings. Headings edited included 



Deleted extra, invalid, headings used by an applicant, specifically: 



Deleted. Q^on-ASCil "garbage" at tho beginning/epd--^^ secretary inrtials/Mename at end of file: 

□ page numbers throughout text; □ other invaJiC 



Inserted mandatory headings, specifically: . 



[ [ Conected an obvious error in tho response, specifically: 



Edited identifiers where upper case is used but lower case is required, or vice versa. 



[ [ Corrected an orror in tho Number of Sequences field, specifically. 



| | A "Hard Pago Break" code was inserted by the applicant. All occurrences had to bo deloted 

| | Deleted ending stop codon m ammo acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patontln bug). Sequences conected: 

I | Other: 



iE^.mjner: The above corrections must bo communicated to the applicant in the first Office 
Action. DO NOT send a copy of this form. 
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SAW SEQUENCE LISTING DATE: 02/06/2001 

PATENT APPLICATION: US/0 9/719,27 7 TIME. 10:28:41 

Input Set : A:\Pto.amc 

Output Set. N:\CKF3\02062001\I719277.raw 

3 <110> APPLICANT: Andrea Branch, Jose Walewski, and Dechard Stump 
5 <120> TITLE OF INVENTION: NOVEL HEPA^:"' _S C VIRUS PSPI1DES AND USES THEREOF 
8 <130> FILE REFERENCE: RII-003PC 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/719, 277 
C~-> 11 <141> CURRENT FILING DATE: 2000-12-08 
13 <160> NUMBER OF SEQ ID NOS . 8 
15 <17 0> SOFTWARE: Paten tin Ver . 2.0 

17 <210> SEQ ID NO: 1 

18 <211> LENGTH: 595 

19 <212> TYPE: DNA 

20 <213> ORGANISM: Hepatitis C virus 

22 <400> SEQUENCE: 1 

23 gcacgaatcc taaacctcaa agaaaaacca aacgtaacac caaccgtcgc ccacaggacg 60 

24 tcaagttccc gggtggcggt cagatcgttg gtggagttta cttgttgccg cgcaggggcc 120 

25 ctagattggg tgtgcgcgcg acgaggaaga cttecgagcg gtcgcaacct cgaggtagac 180 
2 6 gtcagcctat ccccaaggca cgt.cggcccg agggcaggac ctgggctcag cccgggtacc 240 

~^27 cttggcccct ctatggcaat gacggttgcg ggtgggcggg atggctccrg tctccccgtg 300 
^28 gctctcggcc tagctggggc cccacagacc cccggcgtag grcgcgcaat ttgggtaagg 360 
yC]2 9 tcatcgatac ccttacgtgc ggcttcgccg acctcatggg gtacataccg ctcgtcggcg 420 
%p0 cccctcttgg aggcgctgcc agggcccrgg cgcatggcgt ccgggtrctg gaagacggcg 480 
y31 tgaactatgc aacagggaac cttccrggtt gctctttctc tatcttcctt ctggcccrgc 540 
'^3 2 tctcttgcct gactgtgccc gcttcagcct accaagtgcg caatzcctcg gggct 595 
Jf34 <210> SEQ ID NO: 2 
^35 <211> LENGTH: 196 
S|36 <212> TYPE: PRT 

VJ37 <213> ORGANISM: Hepatitis C virus 
w ~3 9 <400> SEQUENCE: 2 

7^4 0 Ala Arg lie Leu Asn Leu Lys Glu Lys Pro Asn Val Thr Pro Thr Val 
U41 1 5 10 15 

4^4 3 Ala His Arg Thr Ser Ser Ser Arg Val Ala Val Arg Ser Leu Val Glu 
M=44 20 25 30 

y 46 plle Thr Cys Cys Arg Ala Giy Ala Leu Asp Tr p Val c y £ Ala ArG Ar 9 

^47 35 40 45 

^4 9 Giy Arg Leu Pro Ser Gly Arg Asn Leu Glu Val Asp Val Ser Leu Ser 

M5 0 50 55 60 

5 2 Pro Arg His Val Gly Pro Arg Ala Gly Pro Gly Leu Ser Pro Gly Thr 
53 65 70 75 * 80 

55 Leu Gly Pro Ser Met Ala Met Arg Val Ala Gly Gly Arg Asp Giy Ser 

56 85 90 " 95 

59 Cys Leu Pro Val Ala Leu Gly Leu Ala Gly Ala Pro Gin Thr Pro Gly 

60 100 105 ' 110 

62 Val Gly Arg Ala He Trp Val Arg Ser Ser He Pro Leu Arg Ala Ala 

63 115 120 125 

6 5 Ser Pro Thr Ser Trp Gly Thr Tyr Arg Sex Ser Ala Pro Leu Leu Glu 
66 130 135 140 

68 Ala Ala Pro Gly Pro Trp Arg Met Ala Ser Gly Phe Irp Lys Thr Ala 

69 145 150 155 " 160 



fiie://C:\Crf 3\Outhold\VsrI7 1 9277.htm 2/6/01 



Page 2 of 5 



RAW SEQUENCE LISTING DATE: C2/06/2 0C1 

PATENT APPLICATION: US/09/7 1 9 , 21 7 TIME: 10:28.41 

Input Set : A:\Pto.amc 

Ouzput Ser: N:\CRF3\02062001\l71927 7.raw 

71 Thr Met Gin Gin Gly Thr Phe Leu Val Ala Leu Ser Leu Ser Ser Phe 

72 165 170 175 

7 4 Trp Pro Cys Ser Leu Ala Leu Cys Pro Leu Gin Pro Thr Lvs Cvs Ala 
75 130 185 190 

77 lie Pro Arg Gly 

78 195 

81 <210> SEQ ID NO: 3 

82 <211> LENGTH: 13 

83 <212> TYPE: PRT 

84 <213> ORGANISM: Hepatitis C virus 

86 <22 0> FEATURE: ^ 

87 <223> OTHER INFORMATION: at location 8; X is N or K 

89 <220> FEATURE ■ y 

90 <223> OTHER INFORMATION • at location 9, X is V or E 

92 <2 20> FEATURE: , 

93 <223> OTHER INFORMATION- at location 13, X is A or V 

95 <400> SEQUENCE: 3 

96 Leu Asn Leu Lys Glu Lys Pro Xaa Xaa Thr Pro Thr Xaa 



97 


1 




5 




10 






QlOO 


<210> 


SEQ ID NO. 


4 










"■BE? ' 


<211> 


LENGTH: 13 










<212> 


TYPE : PRT 












8 "?103 


<213> 


ORGANISM. 


Hepatit: 


-S C 


virus 






K^i05 


<220> 


FEATURE : 








/ 




C106 


<223> 


OTHER INFORMATION: 


at: 


location 6, 


X is L or S 


RJ108 


<220> 


FEATURE : 












*C|10 9 


<223> 


OTHER INFORMATION: 


at 


location 10, 


, X is A 


or V 


Jim 


<220> 


FEATURE : 








r 




'" :| 112 


<223> 


OTHER INFORMATION: 


at 


location 11, 


r X IS A 


or V 


f 114 


<400> 


SEQUENCE: 


4 / 




r 






©115 


Ala Ala His Arg 


Thr Xaa 


Ser 


Ser Arg Xaa 


Xaa Val 


Arg 




1 




5 




10 




J\119 


<210> 


SEQ ID NO. 


5 












<211> 


LENGTH: 14 












Ufa. 21 


<212> 


TYPE : PRT 












pl22 


<213> 


ORGANISM: 


Hepatiti 


.s C 


virus 






ya24 


<400> 


SEQUENCE : 


5 










s 125 


Leu Asn Leu Lys 


Glu Lys 


Pro 


Asn Val Thr 


Pro Thr 


Ala ( 


126 


1 




5 




10 






129 


<210> 


SEQ ID NO: 


6 










130 


<211> 


LENGTH: 14 












131 


<2I2> 


TYPE: PRT 












132 


<213> 


ORGANISM: 


Hepatitis C 


virus 






134 


<400> 


SEQUENCE: 


6 










135 


Ala Ala His Arg 


Thr Ser 


Ser 


Ser Arg Ala 


Val Val 


Arg ( 


136 


1 




5 




10 






13 9 


<210> 


SFQ ID NO: 


7 










140 


<211> 


LENGTH : 18 












141 


<212> 


TYPE : PRT 
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RAW SEQUENCE LISTING DATE: 02/06/20U1 

PATENT APPLICATION : US/09/719 r 277 TIME: 10:28.41 

Input Set . A:\Pto.amc 

Output Sen: N:\CRF3\02062001\I719277.raw 

142 <213> ORGANISM: Hepatitis C \-iras 

144 <400> SEQUENCE ■ 7 

145 Pro Thr Asp Pre Arg Arg Arg Sex Arg Asn Leu Gly Lys Val lie £sc 

146 1 5 io 15 
14 8 Thr Cys 

152 <210> SSQ ID NO: 8 

153 <211> LENGTH: 20 

154 <212> TYPE: PRT 

155 <213> ORGANISM: Hepatitis C virus 

157 <400> SEQUENCE: 8 

158 Gly Cys Ala Thr Arg Lys Thr Ser GIu Arg Ser Gin Pro Arg Glv Arg 

159 1 5 10 15 

161 Arg Ala Pro lie 

162 20 
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VERIFICATION SUMMARY 

PATENT APPLICATION : US/09/719,277 



DAI E - 02/06/2001 
TIME: 10 28:42 



Input Set - A:\?to„amc 

Output Set: N:\CRF3\02062001\I719277.raw 



L:10 M:270 C: Current Application Number differs, Replaced Application Number 
L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:96 M:258 W: Handarcry Feature missing, <221> not found for S3Q 1D^:2 
L:96 M:258 W: Mandatory Feature missing, <222> nor round for SEO IDfc-3 
L-:9S M:340 W: (46} " n " or "Xaa' asec: Feature required, for SEQ ID= . 3 
L:115 M:258 W: Mandatory Feature missing, <22I> not found for S3C ID*. 4 
L;115 M:258 W: Mandatory Feature missing, <222> not found for SFQ ID# : 4 
L:115 M:340 W: (46) " n " or "Xaa" used: Feature required, for SEQ ID*. 4 
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PCT 



RAW SEQUENCE LISTING DATE: 01/3 0/2001 

PATENT APPLICATION US/09/719 , 27 7 TIME : 09 22.53 

Input Set A:\seqlist, txt 

Output Set. N:\CRF3\01302001\I719277.raw D08S Mot ' 



3 <:no> APPLICANT- Andrea Branch, Jose Wale.wski, and Dechard Stump 

5 <120> TITLE OF INVENTION- NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 

8 <130> FILE REFERENCE- RIT-003PC 

C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/7 19 , 27 7 

C--> 11 <141> CURRENT FILING DATE: 2000-12-08 

13 <16 0> NUMBER OF SEQ ID NOS * 8 

15 <170> SOFTWARE * Patentln Ver. 2.0 



Not Cora*, 



ERRORED SEQUENCES 



15 
15 
15 
15 
15 
15 
15 



<21 0> SEQ ID NO 8 
<211> LENGTH 2 0 
<212> TYPE* PR1 

<213> ORGANISM: Hepatitis C virus 
<40 0> SEQUENCE 8 

Gly Cys Aid Thr Arg Lys Thr Ser Glu 
1 5 



Arg 
10 



Ser Gin 



Pro ^rg Gly Arg 
15 



rg A] a Pro 



16: 

E~-> 164/RII-003CPPC 
E--> 161 - 3 - 



lie 
20 



file:/7C:\CRF3\OuthoId:\VsrI7 1 9277.htm 
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VERIFICATION SUMMARY DATE 01/ J 0/2 00 

PATEN 1 APPLICATION US/ 0 9/719 , 2 77 TIliF 09*22 54 



Input Set . A:\seqlist.txt 

Output Set: N:\CRF3\01302001\l719277.raw 

L:10 M:270 c- Current Application Number differs, Replaced Application Nambe 
L:ll M:271 C Current Filing Date differs, RepJaced Current Filing Date 
L:96 M:258 W Mandatory Feature luissiog, <22i> not found for SFO JD« 3 
L:96 M:258 W Mandatory feature missing, <222> not found for SFQ JD* . 3 
L:96 M:340 W* (46) "n" or "Xaa" used t'eaiure required, for SEQ ID:' . 3 
L:115 M:258 W Mandatory Feature missing, <221> not found for SFO 10? -'4 
L:115 M:258 W: Mandatory Feature missing, <222> not found for SFQ ID*. 4 
L:115 M:340 W* (46) "n" or r 'Xaa" used- Feature required, for SEQ ID* 4 
L:164 M:333 E: Wrong sequence grouping, Ammo acids not in groups' 
L:164 M:320 E- (1) Wrong Nucleic Acid Designator, NUMI3ER OF LNV/-0.1D KEYS * 1 
L:167 M:332 s . (32) Invalid/Miss ing Amino Acid Numbering, SEQ ID 8 
L:167 M:252 E: No. of Seq. differs, <211>LENGTH : Input. 20 Found: 21 SEQ 8 
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Background of the Invention 

Hepatitis C virus (HCV) is closely related to both the pestivirus and flavivirus 
genera in the Flaviviridae family. HCV is a single stranded RNA virus; the viral 

10 genome is approximately 9.5 kb. HCV RNA is positive sense and has a unique open 
reading frame which encodes a single polyprotein (Clarke. 1997. J. Gen. Virol 
78:2397). The polyprotein is proteolyticly processed to yield the mature viral proteins 
which include: nucleocapsid, envelope 1, envelope 2, metalloprotease, serine protease, 
RNA heiicase, cofactor, and RNA polymerase. 

15 HCV is a major human pathogen. The virus was found to be the cause of most 

cases of hepatitis which could not be ascribed to hepatitis A, hepatitis B, or hepatitis 
delta virus (Clarke, supra). Over fifty percent of patients with hepatitis C virus (HCV) 
become chronic carriers of the virus; there may be as many as 500 million chronic 
carriers worldwide (Dhillon and Ducheiko. 1995. Histopathology 26: 297), Persistent 

20 infection with the virus causes chronic hepatitis and may ultimately lead to cirrhosis 
and/or cancer (Kuo et al. 1989. Science 244:362). Current therapies for HCV are 
ineffective, consequently there is a need for new approaches to treat HCV infection. 

Summary 

25 The present invention is an important advance in the battle against hepatitis C 

The novel peptides of the invention which are not encoded by the standard, polyprotein 
HCV reading frame, have been shown to elicit an immune response in patients infected 
with HCV and, thus, are produced during HCV infection. Accordingly, the invention 
provides novel HCV polypeptides which are not derived from the HCV polyprotein and 

3 0 methods of their use. 
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In one aspect, the invention pertains to an isolated or recombinant polypeptide or 
fragment thereof encoded by a nucleic acid molecule derived from a hepatitis C virus, 
which polypeptide has at least one of the following characteristics: 

1) at least a portion of the polypeptide is encoded by a reading frame +1 or +2 
relative to the standard hepatitis C virus open reading frame; 

2) at least a portion of the polypeptide is encoded by a reading frame 
corresponding to the reading frame of SEQ ID NO:l in which the first nucleotide of 
SEQ ID NO: 1 is the first nucleotide of a codon; 

3) at least a portion of the polypeptide comprises an amino acid sequence at least 
60% identical to the amino acid sequence shown in SEQ ID NO:2; and 

4) at least a portion of the polypeptide comprises an amino acid sequence 
encoded by a nucleic acid molecule which hybridizes under high stringency to the 
nucleotide sequence shown in SEQ ID NO:L 

In certain embodiments of the invention the novel HCV polypeptides or portion 
thereof of claim 1 are at least about 8 amino acids to at least about 100 amino acids in 
length. In other embodiments, the polypeptides or portions thereof are at least about 14 
amino acids to at least about 30 amino acids in length. 

- In one embodiment, the novel HCV polypeptides or portions thereof are encoded 
by a reading frame +1 or +2 to the standard hepatitis C reading frame. In preferred 
embodiments, the polypeptides are encoded by a reading frame corresponding to the 
reading frame of SEQ ID NO: 1 in which the first nucleotide of SEQ ID NO: 1 is the first 
nucleotide of a codon. In other preferred embodiments, the polypeptides are encoded, in 
part by the nucleic acid molecule of SEQ ID NO: 1 and cause an immune response in a 
subject. 

In other embodiments, the novel HCV polypeptides comprise an amino acid 
sequence at least 60% identical to the amino acid sequence shown in SEQ ID NO:2 and 
cause an immune response in a subject. In preferred embodiments, the novel HCV 
polypeptides comprise an amino acid sequence at least 90% identical to the amino acid 
sequence shown in SEQ ID NO:2 and causes an immune response in a subject. In other 
preferred embodiments, the novel HCV polypeptides comprise an amino acid sequence 
shown in SEQ ID NO: 2 and cause an immune response in a subject. 



WO 99/63941 PCT7US99/12929 

-3- 

In other embodiments, the novel HC V polypeptides comprise an amino acid 
sequence encoded by a nucleic acid molecule which hybridizes under high stringency to 
the nucleotide sequence shown in SEQ ID NO:l . 

In other embodiments, the novel HCV polypeptides comprise at least a portion of 
an amino acid sequence selected from the group consisting of SEQ ID NO: 3 ? SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8 and cause an 
immune response in a subject. 

In still other embodiments, the invention pertains to isolated or recombinant 
polypeptides comprising an amino acid sequence selected from the group consisting of: 
LNLKEKP(X1)(X2)TPT(X3) and AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, 
X2 is V or E, X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V. In yet other 
embodiments, the novel HCV polypeptides consist of an amino acid sequence selected 
from the group consisting of LNLKEKPN VTPTAC and A AH RTS S S RA V VRC 

In another aspect, the invention pertains to a vaccine composition for preventing 
hepatitis C infection in a subject. In one embodiment such a vaccine comprises a novel 
HCV polypeptide. In another embodiment, such a vaccine comprises a nucleic acid 
encoding a novel HCV polypeptide, 

* In another aspect, the invention pertains to an antibody which binds to a novel 
HCV polypeptide. 

In yet another aspect the invention pertains to a kit for detecting a hepatitis C 
infection. In one embodimetn, such a kit comprises a novel HCV polypeptide. In 
another aspect, the kit comprises an antibody which binds to a novel HCV polypeptide. 

In yet another aspect, the invention pertains to a method of preventing HCV 
infection by administering a novel HCV polypeptide to a subject or by causing said 
polypeptide to be synthesized is a subject prior to HCV infection such that HCV 
infection is prevented. 

The invention also pertains to methods of diagnosing HCV infection. In one 
embodiment, the method comprises detecting the presence or absence of antibodies 
which react with a novel HCV polypeptide in the body fluid of a subject, wherein the 
presence of antibodies which bind the polypeptide is indicative of an infection with 
HCV. In another embodiment, the method comprises detecting the presence or absence 
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of a novel HCV polypeptide in the body fluid or tissue of a subject, wherein the presence 
of an HCV polypeptide is indicative of an infection with HCV. 

In still another aspect, the invention pertains to a method for identifying a 
compound which interacts with a novel HCV polypeptide by contacting the polypeptide 
5 with a compound in a cell-free system under conditions which allow interaction of the 
compound with the polypeptide such that a complex is formed; separating the 
compounds which do not form complexes with an HCV polypeptide from those which 
do form complexes with an HCV polypeptide; and isolating and identifying the 
compounds which form complexes with an HCV polypeptide to identify a compound 
10 which interacts with a novel HCV polypeptide. 

Detailed Description 

The present invention is an important step forward in preventing Hepatitis C 
(HCV) infection, in treating ongoing infection, and in improving existing diagnostic 
15 techniques. The invention is based, in part, on the identification of novel polypeptides 
encoded by the Hepatitis C viral genome. These novel polypeptides are not encoded by 
the standard HCV polyprotein reading frame. 

^ Before further description of the invention, certain terms employed in the 
specification, examples and appended claims are, for convenience, collected here. 

20 

Definitions 

As used herein, the language "isolated or recombinant polypeptide" includes a 
polypeptide which is substantially free of cellular material or culture medium when 
produced by recombinant DNA techniques, or chemical precursors or other chemicals 

25 when chemically synthesized. 

As used herein, the term "polypeptide or fragment thereof 1 includes full-length 
polypeptide molecules (from the first amino acid of translation initiation to the last 
amino acid prior to translation termination) and peptide portions of such molecules. 
Preferably the novel HCV polypeptides or fragments thereof are at least about 8 amino 

30 acids to at least about 100 amino acids in length. More preferably the polypeptides or 
fragments thereof are at least about 14 amino acids to at least about 30 amino acids in 



WO 99/63941 PCT/US99/12929 

-5- 

length. In preferred embodiments, the novel HCV polypeptides of the invention 
comprise an amino acid sequence which is conserved among different HCV isolates. 
Such a conserved sequence can readily be determined using an alignment such as that 
provided in Table 1. In other embodiments, the novel HCV polypeptides comprise a 
5 portion of a novel HCV amino acid sequence which is distal (carboxy terminal) to a stop 
codon in the +1 reading frame (relative to the main ORF) that includes the "UG M of the 
"AUG" that is the initiator codon of the main ORF. In a more preferred embodiment, 
the polypeptides or fragments thereof cause an immune response in a subject. 

As used herein, the language "nucleic acid molecule" is intended to include DNA 

10 molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., genomic viral RNA 
or mRNA). The nucleic acid molecule may be single-stranded or double-stranded. 

As used herein, the language " the standard hepatitis C virus open reading frame" 
is the open reading frame (ORF) of the viral RNA which encodes the well-known HCV 
polyprotein. The standard ORF represents the largest ORF in the viral genome. In the 

1 5 infectious clone (GenBank accession number AFO 1 1 75 1 ) the standard ORF uses 

nucleotide 342 as the first nucleotide of a codon and continues until nucleotide 9377. In 
different HCV isolates, the nucleotide which is a first nucleotide of a codon of the 
standard ORF may be at a slightly different position. The nucleotide which is a first 
nucleotide of a codon for any isolate can be easily be obtained to yield the standard HCV 

20 ORF. For example, in the case of known isolates GenBank (or another database 

containing the nucleotide sequence information for the isolate) can be accessed and the 
coding sequence (CDS) information can be obtained. Alternatively, to determine the 
standard ORF of a known or a new isolate, the nucleic acid sequence of the known or 
new isolate can be aligned with a known sequence to give the highest homology (e.g., 

25 using a program such as BLAST). An exemplary BLAST search can be done, e.g., 
using the sequence found in GenBank accession number AFO 1 1751, as the query 
sequence. In this search, nucleotides 342-940 of AFO 1 175 1 were used to search the non- 
redundant sequence database. The ORF of other HCV isolates which corresponds to the 
standard HCV ORF of AFO 1 1751 (in which the initiation codon is at position 342, 

30 which is read as position 1 of the query sequence) can be read from the BLAST 

alignment. For example, the corresponding first nucleotide of a codon for GenBank 
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accession no. HPCCGAA is 342. Another way to find the standard ORF would be to 
use a program* such as Edit Seq. (DNASTAR) which is designed to identify ORFS using 
the AUG aligned with position 342 of AF01 1751 as the start codon. 

The term " percent (%) identity" as used in the context of nucleotide and amino 
5 acid sequences (e.g., when one amino acid sequence is said to be X% identical to another 
amino acid sequence) refers to the percentage of identical residue? shared between the 
two sequences, when optimally aligned. To determine the percent identity of two 
nucleotide or amino acid sequences, the sequences are aligned for optimal comparison 
purposes (e.g., gaps may be introduced in one sequence for optimal alignment with the 

10 other sequence). The residues at corresponding positions are then compared and when a 
position in one sequence is occupied by the same residue as the corresponding position 
in the other sequence, then the molecules are identical at that position. The percent 
identity between two sequences, therefore, is a function of the number of identical 
positions shared by two sequences (i.e., % identity = # of identical positions/total # of 

15 positions x 100). 

Computer algorithms known in the art can be used to optimally align and 
compare two nucleotide or amino acid sequences to define the percent identity between 
thejwo sequences. A preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of two sequences is the algorithm of Karlin and Altschul 

20 (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul 

(1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such an algorithm is incorporated into 
the NBLAST and XBLAST programs of Altschul, et al. (1990) J. Mol. Biol. 215:403- 
1 0. To obtain gapped alignments for comparison purposes, Gapped BLAST can be 
utilized as described in Altschul et al., (1997) Nucleic Acids Research 25(17):3389- 

25 3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

Another preferred, non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). 

30 Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of 
the GCG sequence alignment software package. When utilizing the ALIGN program for 
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comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty 
of 1 2, and a gap penalty of 4 can be used. If multiple programs are used to compare 
sequences, the program that provides optimal alignment (i.e., the highest percent identity 
between the two sequences) is used for comparison purposes. 

As used herein, the language "+1 or +2 relative to the standard hepatitis C virus 
open reading frame" includes reading frames in which a first nucleotide of a codon is 
shifted +1 nucleotide relative to the standard ORF or +2 nucleotide relative to the 
standard ORF. The reading frames encoding the novel polypeptides do not necessarily 
contain an in-frame start codon. 

As used herein, the language "the reading frame of SEQ ID NO: 1 " means that the 
first three nucleotides of the sequence shown in SEQ ID NO:l are the first second and 
third nucleotides of a codon for translation into an amino acid of a polypeptide. The 
reading frame of SEQ ID NO:l is +1 relative to the standard HCV ORF. The language 
"a reading frame corresponding to the reading frame of SEQ ID NO: 1 M means that when 
a sequence from an HCV isolate other than the AF01 1 75 1 isolate shown in SEQ ID 
NO:l is aligned with the sequence of SEQ ID NO:l to give the highest homology, e.g., 
using the BLAST program, it is then read in the same reading frame as SEQ ID NO: 1 to 
give the reading frame corresponding the reading frame of SEQ ID NO:l . The 
nucleotide position of a first nucleotide of a codon of an HCV isolate which corresponds 
to that of SEQ ID NO: 1 may vary from isolate to isolate. An exemplary BLAST search 
illustrating this principle is provided as Appendix B. For example, for GenBank 
accession number AF009606 a first nucleotide of a codon in a reading frame which 
corresponds to the reading frame of SEQ ID NO: 1 is nucleotide 346. 

As used herein, the term "hybridizes under high stringency" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at 
least 70 % homologous to each other typically remain hybridized to each other. 
Preferably, the conditions are such that sequences at least 75 %, 85%, or 95% identical 
to each other typically remain hybridized to each other. Such stringent conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting 
example of stringent hybridization conditions are hybridization in 6X sodium 
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chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X 
SSC, 0.1%SDSat50-65°C. 

As used herein, the term "antibody" is intended to include immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e., 
5 molecules that contain an antigen binding site which specifically binds (immunoreacts 
with) an antigen, such as Fab and F(ab')2 fragments. The terms "monoclonal antibodies" 
and "monoclonal antibody composition", as used herein, refer to a population of 
antibody molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope of an antigen, whereas the term "polyclonal 

10 antibodies" and "polyclonal antibody composition" refer to a population of antibody 
molecules that contain multiple species of antigen binding sites capable of interacting 
with a particular antigen. A monoclonal antibody compositions thus typically display a 
single binding affinity for a particular antigen with which it immunoreacts. 

As used herein, the term "adjuvant" includes agents which potentiate the immune 

15 response to an antigen. Adjuvants can be administered in conjunction with the subject 
polypeptides to additionally augment the immune response. 

As used herein, the term "enhancing an immune response" includes increasing T 
an3/or B cell responses, i.e., cellular and/or humoral immune responses, by treatment of 
a subject using the claimed methods. In one embodiment, the claimed methods can be 

20 used to enhance T helper cell responses. In another embodiment, the claimed methods 
can be used to enhance cytotoxic T cell responses. The claimed methods can be used to 
enhance both primary and secondary immune responses. Preferably, the immune 
response is increased as compared to the response of immune cells to the antigen in the 
absence of treatment with the claimed methods. The immune response of a subject can 

25 be determined by, for example, assaying antibody production, immune cell proliferation, 
the release of cytokines, the expression of cell surface markers, cytotoxicity, enchanced 
ability to clear infection with HCV, etc. 

//. Novel HCV Polypeptides 
30 The novel HCV polypeptides of the invention are not derived from an HCV 

polyprotein, i.e., the polypeptides of the present invention are not encoded by the 
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standard HCV ORF. These alternate reading frame polypeptides are translated from (or 
synthesized based on) a reading frame which is +1 or +2 to the standard HCV ORF. The 
position of the first nucleotide of an ORF in which these polypeptides are translated will 
vary slightly depending upon the isolate studied. For example, for the infectious clone 
5 (GenBank accession number AF01 1751) the first nucleotide of the ORF in which the 
novel HCV polypeptides are translated is nucleotide 346, which is +5 relative to the 
standard HCV ORF. The first nucleotide of a codon of other, known or new isolates 
which results in a reading frame which corresponds to the reading frame of SEQ ID 
NO:l can be determined, e.g., by performing a BLAST search using the nucleic acid 

10 sequence of SEQ ID NO:l as the query sequence as described above. 

Translation of the novel HCV polypeptides of the invention does not necessarily 
have to begin at a start AUG codon. For example, previous work has shown that the 
start AUG of HCV could be mutated to AUU or CUG with little effect on translation 
efficiency (Clarke, supra). Alternatively, RNA editing may be involved in generating an 

1 5 initiator codon. Translation of the novel HCV polypeptides may also begin at the 

initiation site of the standard HCV ORF with a frame shift into a different reading frame. 
Finally, translation may be initiated 5' of the AUG start codon of the standard ORF in 
any- of the three reading frames, but shifted into the +1 reading frame (relative to the 
standard ORF) so as to yield production of peptides which are, in part, at least 60% 

20 identical to a portion of SEQ ID NO. 2. The internal ribosome entry site (IRES) is a 

complex RNA structural element that includes part of the 5' untranslated region of HCV 
RNA and part of the adjacent coding region. It may induce frame shifting or 
translational by passing. 

In preferred embodiments, the polypeptides are encoded by a reading frame 

25 corresponding to the reading frame of SEQ ID NO: 1 in which the first nucleotide of 
SEQ ID NO:l is the first nucleotide of a codon. This reading frame can encode a 
polypeptide of at least about 126 amino acids in length before a termination codon is 
reached. Table 1 presents an alignment of novel HCV polypeptides which are encoded 
in this reading frame from various HCV isolates, along with a majority sequence derived 

30 using the Clustal method of sequence alignment. Stop codons appear in certain of the 
isolates after amino acid 126. However, translation may proceed beyond these stop 
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codons. For example, in certain cases, these stop codons may be sequencing errors. 
Alternatively, readthrough can occur by mutation, altered transcription, RNA editing, 
frame shifting or ribosome slippage. Therefore, even in the polypeptides in which a stop 
codon appears, in certain embodiments of the invention, the novel HCV polypeptide 
5 may be longer, i.e., translation may proceed past a termination codon. Therefore, in the 
case of, e.g., the infectious clone AF01 1751, translation of the polypeptide could 
terminate, for example, at position 163 or 1 86 of SEQ ID NO:2. When the novel HCV 
polypeptides of the invention are synthesized, these stop codons may be ignored. 

In other embodiments, the polypeptides of the invention have some percentage 

1 0 identity to the sequence shown in SEQ ID NO:2. The percent identity between two 

nucleic acid or amino acid sequences can easily be calculated by dividing the number of 
identical bases or amino acids by the total number of bases or amino acids. Sequences 
are aligned to give the highest percent identity and yet provide an alignment which is 
biologically meaningful. Sequences can be aligned manually or, preferably, using an 

1 5 algorithm. For example, in the case of amino acid sequences, a FASTA search can be 
performed of the Swiss Protein database using the Biosum50.Cmp (scoring matrix). The 
gap creation penalty can be set, e.g., at 12 and the extension penalty can be set e.g., at 2. 
The joining threshold can be set, e.g., at 36; the optimization threshold can be set, e.g., at 
24; and the optimization width can be set, e.g., at 16. 

20 In certain embodiments, the novel HCV polypeptides comprise an amino acid 

sequence at least about 40-50% identical to the amino acid sequence shown in SEQ ID 
NO:2. In preferred embodiments, the novel HCV polypeptides comprise an amino acid 
sequence at least about 50-60% identical to the amino acid sequence shown in SEQ ID 
NO:2. In other preferred embodiments, the novel HCV polypeptides comprise an amino 

25 acid sequence at least about 60-70% identical to the amino acid sequence shown in SEQ 
ID NO:2. In more preferred embodiments, the novel HCV polypeptides comprise an 
amino acid sequence at least about 70-80% identical to the amino acid sequence shown 
in SEQ ID NO:2. In still more preferred embodiments, the novel HCV polypeptides 
comprise an amino acid sequence at least about 80-90% identical to the amino acid 

30 sequence shown in SEQ ID NO;2. In another preferred embodiment, the novel HCV 
polypeptides comprise an amino acid sequence shown in SEQ ID NO:2. 
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In preferred embodiments, the polypeptides of the invention have the described 
percent identity over a length of at least about 10 amino acids. In more preferred 
embodiments, the percent identity of the polypeptides extends over a length of at least 
about 20-30 amino acids. In more preferred embodiment, the percent identity of the 
5 polypeptides extends over a length of at least about 30-40 amino acids. In a more 

preferred embodiment, the percent identity of the polypeptides extends over a length of 
at least about 40-50 amino acids. In another more preferred embodiment, the percent 
identity of the polypeptides extends over a length of more than 50 amino acids. In other 
preferred embodiments, the percent identity of the polypeptides extends over a length of 

10 more than 100 amino acids. 

In other embodiments, the novel HCV polypeptides comprise an amino acid 
sequence encoded by a nucleic acid molecule having some percentage identity to the 
nucleic acid molecule shown in SEQ ID NO:l . In certain embodiments, the novel HCV 
polypeptides comprise an amino acid sequence encoded by a nucleic acid molecule at 

1 5 least 70% identical shown in SEQ ID NO:l in which the polypeptide is encoded by the 
reading frame shown in SEQ ID NO: 1 . In preferred embodiments, the novel HCV 
polypeptides comprises an amino acid sequence encoded by a nucleic acid molecule at 
least 80% identical shown in SEQ ID NO: 1 in which the polypeptide is encoded by the 
reading frame shown in SEQ ID NO:l. In more preferred embodiments, the novel HCV 

20 polypeptides comprise an amino acid sequence encoded by a nucleic acid molecule at 
least 90% identical shown in SEQ ID NO: 1 in which the polypeptide is encoded by the 
reading frame shown in SEQ ID NO: 1 . In other embodiments, novel HCV polypeptides 
comprise an amino acid sequence encoded by a nucleic acid molecule shown in SEQ ID 
NO:l in which the polypeptide is encoded by the reading frame shown in SEQ ID NO:l. 

25 In certain embodiments, the novel HCV polypeptides of the invention are 

encoded by a nucleic acid molecule which hybridizes under stringent conditions to the 
nucleic acid sequence shown in SEQ ID NO:l . Stringent hybridization conditions are 
known in the art. In preferred embodiments, such polypeptides are encoded by a nucleic 
acid molecule which hybridizes under stringent conditions to a nucleic acid molecule 

30 from any HCV isolate, but which are read in or synthesized as if read in the reading 
frame of SEQ ID NO: 1. 
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In other embodiments, the novel HCV polypeptides comprise at least a portion of 
an amino acid sequence selected from the group consisting of SEQ ID NO: 3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8 and cause an 
immune response in a subject. Other novel HCV polypeptides can be identified using an 
5 HCV nucleic acid sequence and determining the amino acids which are encoded in the 
+1 or +2 reading frame. Polypeptides of comprising these sequences can be made and 
assayed for reactivity with antibodies from infected subjects. Those polypeptides which 
bind to antibodies, i.e., have elicited an immune response in infected subjects are made 
by the virus during the course of infection and represent preferred novel HCV 
10 polypeptides. 

In still other embodiments, the invention pertains to isolated or recombinant 
polypeptides comprising an amino acid sequence selected from the group consisting of: 
LNLKEKP(X1)(X2)TPT(X3) and AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, 
X2 is V or E, X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V. In yet other 

15 embodiments, the novel HCV polypeptides consist of an amino acid sequence selected 
from the group consisting of LNLKEKPNVTPTAC and AAHRTS S SRA V VRC . 

In certain embodiments of the invention the novel HCV polypeptides of the 
invention are at least about 8 amino acids to at least about 1 00 amino acids in length. In 
other embodiments, the polypeptides of the invention are at least about 10 amino acids 

20 to at least about 50 amino acids in length. In other embodiments, the polypeptides of the 
invention are at least about 14 to at least about 25 amino acids in length. 

In preferred embodiments, the novel HCV polypeptides are of a length sufficient 
to cause an immune response in a subject. Such an immune response can be measured 
using techniques which are known in the art. For example, the immune response elicited 

25 by the HCV polypeptides of the invention can be a T cell-mediated response which can 
be measured by, e.g., cytokine production and/or cellular proliferation or cellular 
cytotoxicity and/or a B cell mediated response which can be measured, e.g., by antibody 
production. 

In certain embodiments the novel HCV polypeptides of the invention are made as 
30 fusion proteins. In addition to utilizing fusion proteins to enhance immunogenicity, 
fusion proteins can also facilitate the expression of proteins, including the novel HCV 



WO 99/63941 



PCT/US99/12929 



-13- 

polypeptides of the present invention. For example, a the novel HCV polypeptide can be 
generated as a glutathione-S-transferase (GST-fusion protein). Such GST-fusion 
proteins can enable easy purification of the novel HCV polypeptide, as for example by 
the use of glutathione-derivatized matrices (see, for example, Current Protocols in 
5 Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 1991)). In another 
embodiment, a fusion gene coding for a purification leader sequence, such as a poly- 
(His)/enterokinase cleavage site sequence, can be can be fused to a the novel HCV 
polypeptide, in order to permit purification of the poly(His)-the novel HCV polypeptide 
by affinity chromatography using a Ni 2 + metal resin. The purification leader sequence 

10 can then be subsequently removed by treatment with enterokinase (e.g., see Hochuli et 
al. (1987) J. Chromatography 4} 1:177; and Janknecht et al. PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fragments coding for different polypeptide 
sequences is performed in accordance with conventional techniques, employing biunt- 

1 5 ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 

20 out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 
et al John Wiley & Sons: 1992). 

It will be understood that the preceding characteristics of HCV polypeptides are 

25 not mutually exclusive. 

///. Production of Novel HCV Polypeptides 

Novel HCV polypeptides can be produced by recombinant DNA techniques. For 
example, a nucleic acid molecule encoding such a polypeptide is cloned into an 
30 expression vector, the expression vector is introduced into a host cell and the novel HCV 
polypeptide is expressed in the host cell. The novel HCV polypeptide can then be 
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isolated from the cells by an appropriate purification scheme using standard protein 
purification techniques. As an alternative to recombinant expression, a novel HCV 
polypeptide can be synthesized chemically using standard peptide synthesis techniques 
or purchased commercially. Moreover, native novel HCV polypeptides can be isolated 
5 from cells (e.g., cultured human cells infected with HCV), for example using an 
antibody. 

A. Recombinant Production of Novel HCV Polypeptides 

In certain embodiments, the novel HCV polypeptides are encoded by a naturally- 

10 occurring HCV nucleic acid molecule. As used herein, a "naturally-occurring" nucleic 
acid molecule refers to an RNA molecule (or a DNA molecule derived therefrom) 
having a nucleotide sequence that occurs in nature (e.g., encodes a protein produced by a 
naturally occurring HCV isolate). 

In addition to naturally-occurring isolates of the novel HCV polypeptides, the 

15 skilled artisan will further appreciate that changes may be introduced by mutation, e.g., 
into an HCV nucleotide sequence thereby leading to changes in the amino acid sequence 
of the encoded HCV polypeptides. 

- For example, an isolated nucleic acid molecule encoding a novel HCV 
polypeptide homologous to the polypeptide of SEQ ID NO: 2, i.e., having a certain 

20 percentage identity to the polypeptide of SEQ ID NO: 2 can be created by introducing 
one or more nucleotide substitutions, additions or deletions into the nucleotide sequence 
of SEQ ID NO: 1 such that one or more amino acid substitutions, additions or deletions 
are introduced into the encoded polypeptide. Mutations can be introduced into SEQ ID 
NO: 1 by standard techniques, such as site-directed mutagenesis and PCR-mediated 

25 mutagenesis. Alternatively, such a polypeptide can be chemically synthesized to yield a 
polypeptide with a change in amino acid sequence from that in the naturally occurring 
polypeptide. 

Preferably, no substitutions or conservative amino acid substitutions are made 
where the is high homology or identity in amino acid residues among the various 
30 isolates. A "conservative amino acid substitution" is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
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amino acid residues having similar side chains have been defined in the art, including 
basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
5 isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). 

Alternatively, in another embodiment, mutations can be introduced randomly 
along all or part of a novel HCV polypeptide coding sequence, such as by saturation 
10 mutagenesis, and the resultant mutants can be screened, e.g., by testing for reactivity 
with antibodies from an individual with a past or present HCV infection. 

B* Expression Vectors and Host Cells 

The nucleic acid molecules described herein can be expressed in an expression 

15 vector to produce a novel HCV polypeptide. As used herein, the term "vector" refers to 
a nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments may be ligated. Another type of vector 
is a viral vector, wherein additional DNA segments may be ligated into the viral 

20 genome. Certain vectors are capable of autonomous replication in a host cell into which 
they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors are 

25 capable of directing the expression of genes to which they are operatively linked. Such 
vectors are referred to herein as "expression vectors" . In general, expression vectors of 
utility in recombinant DNA techniques are often in the form of plasmids. In the present 
specification, "plasmid" and "vector" may be used interchangeably as the plasmid is the 
most commonly used form of vector. However, the invention is intended to include such 

30 other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid 
molecule as described herein in a form suitable for expression of the nucleic acid in a 
5 host cell, which means that the recombinant expression vectors include one or more 
regulatory sequences, selected on the basis of the host cells to be used for expression, 
which is operatively linked to the nucleic acid sequence to be expressed. Within a 
recombinant expression vector, "operably linked" is intended to mean that the nucleotide 
sequence of interest is linked to the regulatory sequence(s) in a manner which allows for 

10 expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system 
or in a host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 

15 Academic Press, San Diego, CA (1 990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector may depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, etc. The 

expression vectors of the invention can be introduced into host cells to thereby produce 
novel HCV polypeptides, including fusion proteins comprising such polypeptides. 

The recombinant expression vectors of the invention can be designed for 
expression of novel HCV polypeptides in prokaryotic or eukaryotic cells. For example, 

25 novel HCV polypeptides can be expressed in bacterial cells such as E. coli, insect cells 
(using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host 
cells are discussed further in Goeddel, Gene Expression Technology; Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
recombinant expression vector may be transcribed and translated in vitro, for example 

30 using T7 promoter regulatory sequences and T7 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
5 vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 

10 the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

15 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amarm et aL, (1988) Gene 69:301-315) and pET lid (Studier et ah, Gene 
Expression Technology: Methods in Enzyrnology 185 , Academic Press, San Diego, 
California (1990) 60-89), Target gene expression from the pTrc vector relies on host 

20 RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET lid vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 

25 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S„ Gene Expression Technology: Methods in 
Enzyrnology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
30 strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
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preferentially utilized in E. coli (Wada et al., (1 992) Nuc. Acids Res. 20:21 1 1 -21 18), 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the novel HCV polypeptides expression vector is a yeast 
5 expression vector. Examples of vectors for expression in yeast S> cerivisae include 
pYepSecl (BaldarL et al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 
(1982) Cell 30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:113-123), andpYES2 
(Invitrogen Corporation, San Diego, CA). 

Alternatively, novel HCV polypeptides can be expressed in insect cells using 

10 baculovirus expression vectors. Baculovirus vectors available for expression of proteins 
in cultured insect cells (e.g., Sf 9 cells) include the pAc scries (Smith et aL, (1983) Mol. 
Cell Biol 3:2156-2165) andthepVL series (Lucklow, V.A., and Summers, M.D., (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid molecule encoding novel HCV 

15 polypeptides of the invention is expressed in mammalian cells using a mammalian 

expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, 
B., (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987), EMBO J, 6:187-195). 
When used in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are 

20 derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 

25 tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 

(1987) Genes Dev. J_:268-277), Iymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 

30 (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc, Natl Acad. Scl USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
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and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,3 1 6 and European Application Publication No. 264,1 66). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss (1990) Science 249 :374-379) and the a-fetoprotein promoter (Campes 
5 and Tilghman (1 989) Genes Dev. 3 :537-546). 

A recombinant expression vector is introduced into a suitable host cell. The 
terms "host cell" and "recombinant host cell" are used interchangeably herein. It is 
understood that such terms refer not only to the particular subject cell but to the progeny 
or potential progeny of such a cell. Because certain modifications may occur in 

10 succeeding generations due to either mutation or environmental influences, such progeny 
may not, in fact, be identical to the parent ceil, but arc still included within the scope of 
the term as used herein. 

A host cell may be any prokaryotic or eukaryotic cell. For example, novel HCV 
polypeptides may be expressed in bacterial cells such as E> colU insect cells, yeast or 

15 mammalian ceils (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 

20 techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook et al. {Molecular Cloning: A 
Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and 

25 other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 

30 generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
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hygromycin and methotrexate. Nucleic acid encoding a selectable marker may be 
introduced into a host cell on the same vector as that encoding the polypeptide or may be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 
5 marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) novel HCV polypeptides. Accordingly, 
the invention further provides methods for producing novel HCV polypeptides using 
these host ceils. In one embodiment, the method comprises culturing the host cell of 
10 invention (into which a recombinant expression vector encoding novel HCV 

polypeptides has been introduced) in a suitable medium until a novel HCV polypeptide 
is produced. In another embodiment, the method further comprises isolating novel HCV 
polypeptides from the medium or the host cell. 

1 5 C Chemical Synthesis of Novel HCV Polypeptides 

The novel HCV polypeptides can be chemically synthesized as is well known in 
the art. Moreover, the peptide can be substituted and/or derivatized to optimize stability. 
The. subject polypeptides can also be synthesized as branched polypeptides, particularly 
for vaccine applications as is known in the art (see, e.g,, Peptides. Edited by Bernd Gutte 
20 Academic Press 1995. pp. 456-493). 

IK Antibodies Which React With Novel HCV Polypeptides 

In yet another aspect, the invention pertains to an antibody which binds to a 
novel HCV polypeptide. A novel HCV polypeptide, or fragment thereof, can be used as 

25 an immunogen to generate antibodies that bind such a polypeptide using standard 

techniques for polyclonal and monoclonal antibody preparation. The invention provides 
numerous antigenic peptide fragments of novel HCV polypeptides for use as 
immunogens. Preferably, an antigenic peptide of such a polypeptide comprises at least 8 
amino acid residues of the amino acid sequence shown in SEQ ID NO: 2 or the ARF #1 

30 polypeptide or ARF #2 polypeptide consensus sequences. Preferably, the antigenic 

peptide comprises at least 10 amino acid residues, more preferably at least 14 amino acid 
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residues, even more preferably at least 18 amino acid residues. Preferred polypeptides 
comprise the ARF #1 consensus sequence :LNLKEKP(X1)(X2)TPT(X3) or the ARF#2 
consensus sequence AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, X2 is V or E, 
X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V polypeptide sequences. 
5 Other preferred HCV polypeptides comprise or consist of the sequence 
LNLKEKPNVTPTAC or AAHRTSSSRAVVRC. 

The subject HCV polypeptides are used to prepare antibodies by immunizing a 
suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example, recombinantly expressed 

10 novel HCV polypeptides or a chemically synthesized novel HCV polypeptide can be 
used. The preparation can further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic HCV polypeptides preparation induces a polyclonal HCV 
polypeptides antibody response. 

1 5 Accordingly, another aspect of the invention pertains to antibodies which react 

with the novel HCV polypeptides. The term r, antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
(immunoreacts with) an HCV polypeptides. The invention provides polyclonal and 

20 monoclonal antibodies that bind HCV polypeptides. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope of a novel HCV polypeptide. A monoclonal 
antibody composition thus typically displays a single binding affinity for a particular 

25 HCV polypeptide with which it reacts. 

Polyclonal anti-HCV polypeptide antibodies can be prepared as described above 
by immunizing a suitable subject with a HCV polypeptide immunogen or attenuated 
HCV virus, or can be obtained from an infected individual. The anti-HCV polypeptide 
antibody titer in the immunized subject can be monitored over time by standard 

30 techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 

immobilized HCV polypeptide. If desired, the antibody molecules directed against HCV 
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polypeptide can be isolated from the animal (e.g., from the blood) and further purified by 
well known techniques, such as protein A chromatography to obtain the IgG fraction. At 
an appropriate time after immunization, e.g., when the antibody titers are highest, 
antibody-producing cells can be obtained from the subject and used to prepare 
5 monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975, Nature 256:495-497) (see also, 
Brown et al. (1981) J. Immunol 127:539-46; Brown et al. (1980) J Biol Chem 255:4980- 
83; Yeh et al. (1976) PNAS 76:2927-31; and Yeh et ah (1982) Int 1 Cancer 29:269-75), 
the more recent human B cell hybridoma technique (Kozbor et al. ( 1 983) Immunol 

10 Today 4:72), the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing monoclonal antibody hybridomas is well known (see generally 
R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological Analyses, 
Plenum Publishing Corp., New York, New York (1980); E. A. Lerner (1981) Yale 1 

15 Biol. Med,, 54:387-402; M. L. Gefter et al. (1977) Somatic Cell Genet, 3:231-36). 

Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically 
splenocytes) from a mammal immunized with a HCV polypeptide immunogen as 
described above, and the culture supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal antibody that binds the HCV 

20 polypeptide. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-HCV 
polypeptide monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; 
Gefter et al. Somatic Cell Genet, cited supra; Lerner, Yale J. Biol Med, cited supra; 

25 Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinary skilled worker 
will appreciate that there are many variations of such methods which also would be 
useful Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the 
same mammalian species as the lymphocytes. For example, murine hybridomas can be 
made by fusing lymphocytes from a mouse immunized with an immunogenic 

30 preparation of the present invention with an immortalized mouse cell line. Preferred 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 
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containing hypoxanthine, aminopterin and thymidine ("HAT medium")- Any of a 
number of myeloma cell lines may be used as a fusion partner according to standard 
techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from the American Type Culture Collection (ATCC), 
5 Rockville, Md. Typically, HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma cells resulting from the 
fusion are then selected using HAT medium, which kills unfused and unproductively 
fused myeloma cells (unfused splenocytes die after several days because they are not 
transformed). 

1 0 Hybridoma cells producing a monoclonal antibody of the invention are detected 

by screening the hybridoma culture supernatants for antibodies that bind HCV 
polypeptides, e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-HCV polypeptide antibody can be identified and isolated by screening a 
1 5 recombinant combinatorial immunoglobulin library (e.g., an antibody phage display 

library) with HCV polypeptides to thereby isolate immunoglobulin library members that 
bind HCV polypeptides. Kits for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant Phage Antibody System, 
Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage Display Kit, Catalog 

20 No. 240612). Additionally, examples of methods and reagents particularly amenable for 
use in generating and screening antibody display library can be found in, for example, 
Ladner et al. U.S. Patent No. 5,223,409; Kang et al. International Publication No. WO 
92/18619; Dower et al. International Publication No. WO 91/17271; Winter et al. 
International Publication WO 92/20791 ; Markland et al. International Publication No. 

25 WO 92/15679; Breitling et al. International Publication WO 93/01288; McCafferty et al. 
International Publication No. WO 92/01047; Garrard et al. International Publication No, 
WO 92/09690; Ladner et al. International Publication No. WO 90/02809; Fuchs et al. 
(1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum Antibod Hybridomas 3:81- 
85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBO J 12:725- 

30 734; Hawkins et al. (1992) J Mol Biol 226:889-896; Clarkson et al. (1991) Nature 
352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Garrad et al. (1991) 
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Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc Acid Res 19:4133-4137; 
Barbas et al. (1991) PNAS 88:7978-7982; and McCafferty et al. Nature (1990) 348:552- 
554. 

Additionally, recombinant anti-HCV polypeptide antibodies, such as chimeric 
5 and humanized monoclonal antibodies, comprising both human and non-human 

portions, which can be made using standard recombinant DNA techniques, are within 
the scope of the invention. Such chimeric and humanized monoclonal antibodies can be 
produced by recombinant DNA techniques known in the art, for example using methods 
described in Robinson et al. International Patent Publication PCT/US 86/02269; Akira, et 

10 al. European Patent Application 1 84, 187; Taniguchi, M., European Patent Application 
171,496; Morrison et al. European Patent Application 173,494; Neuberger et al PCT 
Application WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. 
European Patent Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu 
et al. (1987) /WAS 84:3439-3443; Liu et al. (1987) J. Immunol 139:3521-3526; Sun et 

15 al. (1987) PNAS 84:214-218; Nishimura et al. (1987) Cane, Res. 47:999-1005; Wood et 
al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl Cancer Inst 80:1553- 
1559); Morrison, S. L. (1985) Science 229:12024207; Oi et al. (1986) BioTechniques 
4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; 
Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol 

20 141:4053-4060. 

An anti-HCV polypeptide antibody (e.g., monoclonal antibody) can be used to 
isolate or detect HCV polypeptides by standard techniques, such as affinity 
chromatography, immunoprecipitation, ELISA, or RIA as is well known in the art. An 
anti-HCV polypeptide antibody can facilitate the purification of natural HCV 

25 polypeptide from cells and of recombinantly produced HCV polypeptides expressed in 
host cells. Moreover, an anti-HCV polypeptide antibody can be used to detect HCV 
polypeptides from a body fluid of a subject which is suspected to have an HCV 
infection. Detection may be facilitated by coupling (i.e., physically linking) the antibody 
to a detectable substance. Examples of detectable substances include various enzymes, 

30 prosthetic groups, fluorescent materials, luminescent materials and radioactive materials. 
Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, p- 
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galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
5 luminescent material includes luminol; and examples of suitable radioactive material 
include 125 I, 131 I, 35 S or 3 H, 

Such antibodies can be incorporated in diagnostic kits and are also useful in 
passive immunization against HCV in patients which have an active HCV infection or 
are likely to be exposed to HCV. 

10 

IV Uses of Novel HCV Polypeptides 

In another aspect, the invention pertains to a vaccine composition which is 
administered to a subject prior to exposure to HCV to preventing hepatitis C infection in 
the subject. In one embodiment, the vaccine comprises a novel HCV polypeptide of the 
1 5 invention. In another embodiment, the vaccine causes a novel HCV polypeptide of the 
invention to be synthesized in a subject. 

v A Vaccines 

Novel HCV polypeptide sequences appropriate for use in vaccine compositions 
20 for the prevention of HCV in a subject can easily be determined. For example, epitopes 
which elicit an immune response can be identified by screening in an immunoassay 
against sera from patients with a past or ongoing HCV infection. Alternatively, 
immunogenic polypeptides can be identified by computer analysis to identify 
immunogenic epitopes. Finally, the full-length novel polypeptide could be used in a 
25 vaccine. 

In another embodiment, agents which are known adjuvants can be administered 
with the subject polypeptides. At this time, the only adjuvant widely used in humans 
has been alum (aluminum phosphate or aluminum hydroxide). Saponin and its purified 
component Quil A, Freund's complete adjuvant and other adjuvants used in research 
30 and veterinary applications have potential use in human vaccines. However, new 

chemically defined preparations such as muramyl dipeptide, monophosphoryl lipid A, 
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phospholipid conjugates such as those described by Goodman-Snitkoff et al. J. Immunol. 
147:410-415 (1991) resorcinols, non-ionic surfactants such as polyoxyethylene oleyl 
ether and n-hexadecyl polyethylene ether, enzyme inhibitors include pancreatic trypsin 
inhibitor, diisopropylfluorophosphate (DEP) and trasylol can also be used. In 
5 embodiments in which antigen is administered, the antigen can e.g., be encapsulated 
within a proteoliposome as described by Miller et al., J. Exp. Med. 176:1739-1744 
(1992) and incorporated by reference herein, or in lipid vesicles, such as Novasome TM 
lipid vesicles (Micro Vescular Systems, Inc., Nashua, N. HL), to further enhance immune 
responses. 

10 In yet other embodiments, as an alternative to administering the novel HCV 

polypeptide, the polypeptide can be synthesized by the subject. This can be done using a 
piasmid DNA construct which is similar to those used for delivery of reporter or 
therapeutic genes. Such a construct preferably comprises a bacterial origin of replication 
that allows amplification of large quantities of the piasmid DNA; a prokaryotic 

15 selectable marker gene; a nucleic acid sequence encoding a novel HCV polypeptide or 
portion thereof; eukaryotic transcription regulatory elements to direct gene expression in 
the host cell; and a polyadenylation sequence to ensure appropriate termination of the 
expressed mRNA (Davis. 1997. Curr. Opin. Biotechnol. 8:635). Vectors used for DNA 
immunization may optionally comprise a signal sequence (Michel et al. 1995. Proc. 

20 Natl Acad Sci USA, 92:5307; Donnelly et al. 1996. 1 Infect Dis. 173:314). DNA 
vaccines can be administered by a variety of means, for example, by injection (e.g., 
intramuscular, intradermal, or the bioiistic injection of DNA-coated gold particles into 
the epidermis with a gene gun that uses a particle accelerator or a compressed gas to 
inject the particles into the skin (Haynes et al. 1996. J. Biotechnol. 44:37)). 

25 Alternatively, DNA vaccines can be administered by non-invasive means. For example, 
pure or lipid-formulated DNA can be delivered to the respiratory system or targeted 
elsewhere, e.g., Peyers patches by oral delivery of DNA (Schubbert. 1997. Proc. Natl 
Acad. Sci. USA 94:961). Attenuated microorganisms can be used for delivery to 
mucosal surfaces. (Sizemore et aL 1995. Science. 270:29) 
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Any of the instant vaccine compositions can comprise (or encode) one or more 
epitopes (either contiguous or non contiguous) of a novel HCV polypeptide. Such 
preparations can further comprise polypeptide sequences derived from an HCV 
polyprotein sequence. In other embodiments, such a vaccine composition can further 
5 comprise a compound which will enhance the immunologocial reactivity of the novel 
HCV polypeptide epitope. For example, the immunogenicicty of the novel HCV 
polypeptides may be enhanced by making a fusion proteins comprising a novel HCV 
polypeptide fused to a different polypeptide, i.e., not a novel HCV polypeptide. 
Techniques for making such fusion proteins are known in the art. Alternatively, a 

10 vaccine can comprise an immunoregulatory molecule, such as a cytokine. For example, 
in one embodiment, plasmids for DNA vaccination can express a single immunogen, or 
two sequences can be coexpressed. In one embodiment, the additional sequences can be 
additional immunogens (novel HCV polypeptides or HCV polyprotein polypeptides or 
other polypeptides) or can encode modulators of immune responses such as lymphokine 

15 genes or costimulatory molecules (Iwasaki et al 1997. J. Immunol. 158:4591) 
Typically, vaccine compositions of the present invention are prepared as 
injectables, either as liquid solutions or suspensions. Solid forms suitable for solution 
in^or suspension in, liquid prior to injection may also be prepared- The composition 
may also be emulsified, or the polypeptide encapsulated into liposomes. The 

20 polypeptide may be mixed with pharmaceutically acceptable excipients, for example, 
water, saline, dextrose, glycerol, ethanol, or the like. The composition may also 
comprise minor amounts of, for example, wetting agents, pH buffering agents and/or 
adjuvants, such as aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine 
(thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637 or nor-MDP), 

25 N-acetylmuramlyl-L-alanyl-D-isoglutaminyI-L-alanine-2-(r-2 r -dipalmitoyl-sn-glycerol- 
3-hydroxyphosphoryloxy)ethylamine (CGP 1983 5 A, or MTP-PE), or bacterial 
components. 

Such vaccine compositions are generally administered parenterally, by injection, 
usually wither subcutaneously or intramuscularly. Other formulations may be 
30 administered orally, by inhalation or as suppositories. 
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The polypeptides may be incorporated into the vaccine in a neutral or salt form. 
Pharmaceutically acceptable salts include the acid addition salts (formed with free amino 
groups of the polypeptide) and which are formed with inorganic acids, such as, for 
example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
5 tartaric, maleic, or the like. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, 2-ethylamino ethanol, histidine, procaine, or the like. 

The vaccines are administered so as to be compatible with the dosage 
10 formulation, and in such an amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered depends on the subject to be treated, the 
capacity of the subject's immune system to mount an immune response to the vaccine, 
and the degree of protection desired. The range of 5 ug to 250 u.g of antigen per dose, 
however, is often appropriate. The vaccine compositions may be given in a single dose 
15 or in multiple doses. Appropriate doses are well within the skill of the art to determine, 
and do not constitute undue experimentation. 

In still another embodiment, the invention pertains to a method of preventing 
HSV in a subject by administering a novel HCV polypeptide to a subject or by causing a 
novel HCV polypeptide to be expressed in a subject. 

20 

A Diagnostic Kits 

In another aspect of the invention, methods for diagnosing HCV infection e.g., 
either a past or present infection, and diagnostic kits are provided for detecting an 
infection with HCV. 

25 In one embodiment, the invention provides a method of diagnosing HCV 

infection by detecting the presence or absence of antibodies in the body fluid of a subject 
which bind to a novel HCV polypeptide. In one embodiment the method comprises 
incubating a test sample under conditions which allow the binding of a novel HCV 
polypeptide and an antibody in the test sample of body fluid and detecting the binding of 

30 polypeptide and antibody. 
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Test samples can be derived from any appropriate body fluid or tissue 
preparation, for example, whole blood, plasma, serum, spinal fluid, lymph fluid, tears, 
saliva, milk, or liver tissue preparations. 

Detection of the binding between a novel HCV polypeptide of the 
5 invention and an antibody can be accomplished using any technique which is known in 
the art and can be facilitated using antibodies labelled as described above. 

Antibodies which bind to novel HCV polypeptide can be detected using a 
number of different screening assays known in the art, such as an enzyme-linked 
immunosorbent assay (ELISA), a radioimmunoassay (RIA), or a Western Blot Assay. 
10 Each assay generally detects the presence of protein-antibody complexes of particular 
interest by employing a labeled reagent (e.g., an antibody) specific for the complex of 
interest. Accordingly, in the present invention, these assays are used to detect novel 
HCV polypeptide-antibody complexes formed between immunoglobulins (e.g., human 
IgG, IgM and IgA) contained in a biological sample and a novel HCV polypeptide. As 
15 will be described below, these protein-antibody complexes are preferably detected using 
an enzyme-linked antibody or antibody fragment (e.g., a monoclonal antibody or 
fragment thereof) which recognizes and specifically binds to the polypeptide-antibody 
complexes. 

In one embodiment of the method, a sandwhich ELISA assay is used. 

20 For example, a novel HCV polypeptide with or without conjugation to a carrier, such as 
activated BSA is immobilized on a plate. A body fluid sample from an individual is 
contacted with a novel HCV polypeptide under conditions which allow binding of the 
antibodies in the sample to the polypeptides. The sample is then removed, and any 
antibody which has bound to the HCV polypeptide is detected by contacting the sample 

25 with a labeled secondary antibody or antibody fragment which binds to an antibody 
which might be present in the subjects sample, e.g., an anti-human antibody. The 
unbound secondary antibody is removed and the presence of secondary antibody which 
remains bound is detected, e.g., using a label as described above. Possible controls for 
use in the method include body fluids from uninfected subjects and polypeptides which 

30 are not novel HCV polypeptides. In accordance with the present invention, the presence 
of such an antibody is indicative of an infection with HCV. 
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In another embodiment of the assay the test sample can be tested for the 
presence of novel HC V polypeptides using known antibodies. In these embodiments, 
antibodies that bind to novel HCV polypeptides are used to detect the presence of novel 
HCV polypeptides in the body fluid of a subject or in a cell of a subject. In performing 
5 such an assay, the antibodies which bind to a novel HCV polypeptide are contacted with 
a cell or body fluid of a subject under conditions where a novel HCV polypeptide in the 
subject's sample can bind to the antibody. Unbound antibody is removed and bound 
antibody is detected. Any of the antibodies described above can be used in practicing 
this method. Preferred antibodies for use in the methods of this embodiment are highly 

10 specific, including monospecific and, more preferably, monoclonal antibodies or 
fragments thereof. In preferred embodiments, such antibodies are labelled. In other 
embodiments, such antibodies are detected by employing a secondary antibody which 
binds to them and not to the test sample from a subject. The presence of a novel HCV 
polypeptide in the subjects sample is indicative of an infection with HCV, 

15 In yet another aspect, the present invention provides an assay kit for 

diagnosing HCV infection in a subject. Preferably, the kit contains a solid support (e.g., 
an ELISA plate) capable of adsorbing immunoglobulin (e.g., IgG, IgM and IgA) from a 
subject's sample (preferably a human biological sample, such as a body fluid) and a 
monoclonal antibody or fragment thereof specific for a novel HCV polypeptide. In 

20 another embodiment, the solid support can be omitted from the kit. In another 

embodiment, the kit contains a solid support (e.g., an ELISA plate or a slide) and a 
monoclonal antibody or fragment thereof specific for a novel HCV polypeptide. In other 
embodiments, the solid support can be omitted. The assay kit can optionally include 
instructions, or additional reagents such as a solution for washing unbound proteins from 

25 the solid support, and materials needed for performing a detection assay. 



C Targets for Therapeutic Intervention 

The novel HCV polypeptides of the invention are also targets for anti-HC V 
therapy. As such the invention provides methods for identifying compounds which 
30 interact with a novel HCV polypeptide and, thus, are likely to interfere with infection. 
In one embodiment, the method involves contacting the polypeptide with a compound in 
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a cell-free system under conditions which allow interaction of the compound with the 
polypeptide such that a complex is formed. The complexes of polypeptide and 
compound can then be separated from the compounds which do not bind to the HCV 
polypeptide, the compounds which bind to HCV polypeptides can then be isolated and 
5 identified. 

Exemplary compounds which can be screened for activity in the subject assays 
include, but are not limited to, peptides, nucleic acids, carbohydrates, small organic 
molecules, and natural product extract libraries. The term "non-peptidic compound" is 
intended to encompass compounds that are comprised, at least in part, of molecular 

10 structures different from naturally-occurring L-amino acid residues linked by natural 
peptide bonds. However, "non-peptidic compounds" are intended to include compounds 
composed, in whole or in part, of peptidomimetic structures, such as D-amino acids, 
non-naturally-occurring L-amino acids, modified peptide backbones and the like, as well 
as compounds that are composed, in whole or in part, of molecular structures unrelated 

1 5 to naturally-occurring L-amino acid residues linked by natural peptide bonds. "Non- 
peptidic compounds" also are intended to include natural products. 

A recent trend in medicinal chemistry includes the production of mixtures of 
cojnpounds, referred to as libraries. While the use of libraries of peptides is well 
established in the art, new techniques have been developed which have allowed the 

20 production of mixtures of other compounds, such as benzodiazepines (Bunin et aL 1992. 
J. Am. Chem. Soc. 114:10987; DeWitt et aL 1993. Proc. Natl. Acad. Sci. USA 90:6909) 
peptoids (Zuckermann. 1994. J. Med. Chem. 37:2678) oligocarbamates (Cho et al 
1993. Science. 261:1303), and hydantoins (DeWitt etal. supra). Rebek et al. have 
described an approach for the synthesis of molecular libraries of small organic molecules 

25 with a diversity of 104-105 (Carell et al. 1994. Angew. Chem. Int. Ed. Engl. 33:2059; 
Carell et al. Angew. Chem. Int. Ed. Engl. 1994. 33:2061). 

The compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries, 

30 synthetic library methods requiring deconvolution, the 'one-bead one-compound' library 
method, and synthetic library methods using affinity chromatography selection. The 
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biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. Anticancer Drug Des. 1997, 12:145). 

In one embodiment, the test compound is a peptide or peptidomimetic. In 
5 another, preferred embodiment, the compounds are small, organic non-peptidic 
compounds. 

Other exemplary methods for the synthesis of molecular libraries can be found in 
the art, for example in: Erb et al. 1 994. Proc. Natl. Acad. Sci. USA 91 : 1 1422; Horweil 
et al. 1996 Immunopharmacology 33:68; and in Gallop et al. 1994. J. Med. Chem. 
10 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-423), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 

15 (Scott and Smith (1990) Science 249:386-390); (Devlin (1 990)Science 249:404-406); 
(Cwirla et al. (1990) Proc, Natl Acad. Set 87:6378-6382); (Felici (1991) 1 Mol Biol 
222:301-310); (Ladner supra.). 

In many drug screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the number of 

20 compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 
preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Accordingly, in an exemplary screening assay of the present invention, 

25 the compound of interest is contacted with a novel HCV polypeptide. Detection and 

quantification of novel HCV polypeptide/compound complexes identifies the compound 
as a potential modulator of a novel HCV polypeptide. The efficacy of the compound can 
be assessed by generating dose response curves from data obtained using various 
concentrations of the test compound. Moreover, a control, e.g., using a different 

30 polypeptide can also be performed to provide a baseline for comparison. 
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Complex formation between the novel HCV polypeptide and a compound 
may be detected by a variety of techniques. For instance, modulation of the formation of 
complexes can be quantitated using, for example, detectably labelled proteins such as 
radiolabeled, fluorescently labelled, or enzymatically labelled novel HCVpolypeptides, 
5 by immunoassay, or by chromatographic detection. 

Typically, it will be desirable to immobilize either the novel 
HCVpolypeptide or the compound or both to facilitate separation of compound/novel 
HCVcomplexes from uncomplexed forms, as well as to accommodate automation of the 
assay. In one embodiment, a fusion protein can be provided which adds a domain that 

10 allows the polypeptide to be bound to a matrix. For example, glutathione-S- 

transferase/receptor (GST/receptor) fusion protein forms of the novel polypeptides can 
be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the test 
compound which is bound to beads and incubated under conditions conducive to 

15 complex formation, e.g. at physiological conditions for salt and pR though slightly 

more stringent conditions may be desired, e.g., at 4°C in a buffer containing 0.6MNaCl 
or a detergent such as 0.1% Triton X-100. Following incubation, the uncomplexed 
fomis are removed by washing and compounds which bind to novel HCV polypeptides 
are identified. 

20 Other techniques for immobilizing proteins on matrices are also available 

for use in the subject assay. For instance, the novel HCV polypeptides can be 
immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
novel HCV polypeptides can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, 

25 Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with the novel HCV polypeptides can be 
derivatized to the wells of the plate, and the novel HCV polypeptides trapped in the 
wells by antibody conjugation. As above, preparations of a novel HCV polypeptide and 
a test compound are incubated in the wells of the plate, and the amount of novel HCV 

30 polypeptides/ compound complex trapped in the well can be quantitated. 
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Other exemplary methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the novel HCV polypeptide, or which are 
reactive with the receptor protein and compete for binding with the novel HCV 
5 polypeptide; as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the novel HCV polypeptide. In the instance of the latter, the 
enzyme can be chemically conjugated or provided as a fusion protein with the novel 
HCV polypeptide. To illustrate, the novel HCV polypeptide can be chemically cross- 
linked or genetically fused with alkaline phosphatase, and the amount of novel HCV 

10 polypeptide trapped in the complex can be assessed with a chromogenic substrate of the 
enzyme, e.g. paranitrophenylphosphate. Likewise, a fusion protein comprising the novel 
HCV polypeptide and glutathione-S-transferase can be provided, and complex formation 
quantitated by detecting the GST activity using 1 -chloro-2,4-dinitrobenzene (Habig et al 
(1974) J Biol Chem 249:7130). 

15 For processes which rely on immunodetection for quantitating one of the proteins 

trapped in the complex, antibodies against the protein, such as the anti-novel HCV 
antibodies described herein, can be used. Alternatively, the protein to be detected in the 
complex can be "epitope tagged" in the form of a fusion protein which includes, in 
addition to the novel HCV polypeptide, a second polypeptide for which antibodies are 

20 readily available (e.g. from commercial sources). For instance, the GST fusion proteins 
described above can also be used for quantification of binding using antibodies against 
the GST moiety. Other useful epitope tags include myc-epitopes (e.g., see Ellison et al. 
(1991) J Biol Chem 266:21 150-21 157) which includes a 10-residue sequence from c- 
myc, as well as the pFLAG system (International Biotechnologies, Inc.) or the pEZZ- 

25 protein A system (Pharamacia, NJ). 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the skill of the art. Such 
techniques are explained fully in the literature. See, for example, Genetics; Molecular 

30 Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, J, et al (Cold Spring Harbor 
Laboratory Press (1989)); Short Protocols in Molecular Biology, 3rd Ed., ed. by 
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Ausubel, F. et al (Wiley, NY (1995)); DNA Cloning, Volumes I and II (D. N. Glover 
ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed. (1984)); Mullis et al. U.S. Patent 
No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1984)); 
the treatise, Methods In Enzymology (Academic Press, Inc., N.Y.); Immunochemical 
5 Methods In Cell And Molecular Biology (Mayer and Walker, eds., Academic Press, 
London (1987)); Handbook Of Experimental Immunology, Volumes I-IV (D. M Weir 
and C. C. Blackwell, eds. (3986)); and Miller, J. Experiments in Molecular Genetics 
(Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1972)). 

10 The contents of all references, pending patent applications and published patents, 

cited throughout this application are hereby expressly incorporated by reference. 
Specifically, the contents of USSN 60/088,670, titled Novel Hepatitis C Virus Peptides 
And Uses Thereof, filed on June 9, 1998 and 60/089,138, titled Novel Hepatitis C Virus 
Peptides And Uses Thereof, filed on June 11, 1998 are incorporated herein by this 

1 5 reference. 

The invention is further illustrated by the following examples, which should not 
be>xonstrued as further limiting. 

20 Examples 

Example 1 . The detection of antibodies against novel HCV polypeptides. 

Consensus polypeptides were synthesized based on the sequence homology 
25 between novel HCV polypeptides shown in Table 1. The following peptides were made 
using conventional techniques by Biosynthesis Corp. (Lewisville, TX): 
LNLKEKPNVTPTAC and AAHRTSSSRAVVRC (alternate reading frame (ARF) 
polypeptides 1 and 2, respectively). The following polypeptides derived from HCV 
CORE protein were used as controls: PTDPRRRSRNLGKVIDTC and 
30 GC ATRKTSERSQPRGRRAPI . The peptides were conjugated to activated bovine 

serum albumin giving up to 4 peptide molecules for each BSA molecule. Peptide-BSA 
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conjugates at a concentration of approximately 0.5 mg/ml were shipped on ice in 3 mis 
of phosphate buffered saline, 0.1 M, pH 7.4. They were aliquoted and stored at -20°C 
until use. 

ELISAs were performed as described by Kirkegaard and Perry Laboratories, Inc. 
5 (Gaithersburg, MD). For use in the ELISA, the polypeptides were dissolved in IX 
"coating buffer" to give a peptide-BSA concentration of 1 jag/ml; 100 jil were added to 
96 well microtiter plates (Falcon 3072, Becton Dickinson and Company, Franklin Lakes, 
NJ) and incubated at room temperature for 1 hour or overnight at 4°C. The coating 
solution was removed and plates were then blocked by adding 300 (ill of 1% BSA in 
10 phosphate buffered saline (as prepared by Kirkegaard and Perry), and incubating for 30 
min. at room temperature. The blocking solution was removed and 100 pil of 1% BSA 
(IX) was added. 

Sera was obtained from patients which were known to have or to have previously 
had an HCV infection. Serum samples were diluted 1 : 1 00 in IX BSA and 1 00 jil was 

1 5 added to the first well of each (yielding a total of 200 jal). Serum samples were then 
serially diluted (two-fold at each step). Control wells contained BSA only. Plates were 
incubated for 1 hour at room temperature with moderate agitation to allow binding. 
Plates were washed 5 times with IX wash solution (PBS and 0.02% Tween). After 
washing, 100 fil of the secondary antibody was immediately added and allowed to react 

20 for 1 hour at room temperature. The secondary antibody was either the Fab fragment of 
anti-human IgG conjugated to horse radish peroxidase (HRP) at a dilution of 1 ; 1 000, or 
anti-human IgG (in PBS with 1% BSA). Plates were washed 5 times with IX washing 
solution, and then 100 p.1 of hydrogen peroxide and TMB were added and allowed to 
react for 10 to 30 minutes at room temperature. To stop the reaction, 100 \il of 1M 

25 phosphoric acid were added. O.D. measurements were obtained by using a dual 
wavelength scanner: 450 nm values-650 nm values. 

Results of ELISA Tests for Antibodies of HCV-Specific Polypeptides 

30 Control Sera HCV Patient Sera 
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Mean OD (SX>.) Mean OD (S.D.) 

(N=2, tested in duplicate) (N=6, tested in duplicate) 



5 



CORE#l 



CORE #2 



0.379 (0.063) 
0.775 (0,053) 



0.851 (0.231) 
1.390 (0.400) 



ARF#1 



0.763 (0.133) 
0.401 (0.039) 



2.286 (0.215) 
0.813(0.813) 



ARF #2 



Example 2. Development of a western blot assay for detecting antibody production to 
1 0 alternate HCV reading frame proteins. 

Five micrograms of BSA-alternate reading frame peptide conjugates and 
appropriate contols (quenched BSA, BSA-BSA peptide conjugates) were denatured 
by incubating in standard Lamelli loading buffer (with beta-mercaptoethanol 
15 and SDS), at lOO^C for three minutes. Samples were then cooled and spun in a 
microcentrifuge prior to loading on a discontinuous 1 .5 mm thick SDS-PAGE gel 
(4% stacking gel [pH 6.8}and 12.5% [pH 8.4] resolving gel). The running 
buffer was TRIS-glycine/SDS (0.1%), with an unadjusted pH of approximately 8.4. 



20 90 volts. After the bromophenoi blue (BB) entered the resolving gel, the voltage was 
increased to 1 60 V. The gels were run for approximately 1 .5 hrs, or until the 
BB was 2/3 of the way through the gel. After stopping the run, the gels were 
either stained with coomassie blue, or equlibrated for 1 5 minutes with the 
transfer buffer (IX CAPS, ph 1 1.0, 10% MeOH. PVDF membranes (Immobilon-P 

25 [0.45 micon pore size], were wet in 100% MeOH for 1 min, then 50:50 

MeOH:double distilled water for 5 mins, and finally equilibrated with the transfer buffer. 
The transfer was set up in transfer buffer (filter paper, gel , PVDF 
membrane, and filter paper), and slipped into the BIO-RAD transfer tank. The 
transfer was run at 20 volts, overnight (approximately 10 hrs), at 4°C, with stirring. 

30 After transfer, the tank was disassembled, the membrane was soaked in 1 00% 

MeOH for one minute, then allowed to air-dry. In order to visualize the 



Samples were run through the stacking gel (1 cm) for approximately 45 mins at 
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efficiency of the transfer, the membranes (after re-wetting in MeOH then 
water) were incubated with 1% Ponceau-S red stain, and rinsed in double distilled water. 
After scanning the image, the dye was removed in a dilute NaOH solution (1ml 
saturated NaOH in lOOmls ddwater. 
5 The membranes were then blocked in 3% NFDM (6 grams non-fat dried milk in 

20Gmls of lx TBS [TRIS-buffered saline, ph 7.4), for one hour. Afta 
blocking, the membranes were rinsed in two washes of lxTBS, 200 mis apiece. 

The membranes were then incubated with the primary antibody solution 4 mis of 
a 1/200 dilution of patient sera in 1% NFDM in 1XTTBS (TBS with 0.025% 
10 Tween-20). This incubation lasted one hour, at 30oC, in glass with slow rotation of the 
tube or beaker. After this incubation, the membranes were washed three times 
in 200 mis of IX TTBS. 

The secondary antibody solution was 200 mis of a 1/3000 dilution of the 
BIO-RAD goat anti-human alkaline phosphatase conjugate, in 1% NFDM in IX 
15 TTBS. This incubation lasted 1 hr, at 30oC, with gentle shaking. 

After this incubation, the membranes were washed twice with 200 ml of 
IX TTBS (5 mins apiece), and one with 200 mis of IX TBS. 

^ The bands were visualized with the BIO-RAD AP substrate kit (200 mis total), 
with gentle, occasional shaking. After visualization, the membranes were 
20 washed several times with ddwater, followed by one wash with MeOH, then 
air-dried and photographed or scanned. 
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What is claimed is: 

1 . An isolated or recombinant polypeptide or fragment thereof encoded by a nucleic 
acid molecule derived from a hepatitis C virus, having at least one of the following 

5 characteristics: 

1) at least a portion of the polypeptide is encoded by a reading frame +1 or +2 
relative to the standard hepatitis C virus open reading frame; 

2) at least a portion of the polypeptide is encoded by a reading frame 
corresponding to the reading frame of SEQ ID NO:l in which the first nucleotide of 

1 0 SEQ ID NO: 1 is the first nucleotide of a codon; 

3) at least a portion of the polypeptide comprises an amino acid sequence at least 
60% identical to the amino acid sequence shown in SEQ ID NO:2; and 

4) at least a portion of the polypeptide comprises an amino acid sequence 
encoded by a nucleic acid molecule which hybridizes under high stringency to the 

15 nucleotide sequence shown in SEQ ID NO: 1 . 

2. The polypeptide or portion thereof of claim 1 , wherein said polypeptide is at least 
about 8 amino acids to at least about 1 00 amino acids in length. 

20 3 . The polypeptide or portion thereof of claim 2, wherein said polypeptide is at least 
about 14 amino acids to at least about 30 amino acids in length. 

4. The polypeptide or portion thereof of claim 1, wherein said polypeptide is 
encoded by a reading frame +1 or +2 to the standard hepatitis C reading frame. 

25 

5. The polypeptide or portion thereof of claim 1 , wherein said polypeptide is 
encoded by a reading frame corresponding to the reading frame of SEQ ID NO:l in 
which the first nucleotide of SEQ ID NO: 1 is the first nucleotide of a codon. 
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6, The polypeptide or portion thereof of claim 5, wherein said polypeptide or 
portion thereof is encoded by the nucleic acid molecule of SEQ ID NO: 1 and causes an 
immune response in a subject. 

5 7. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence at least 60% identical to the amino acid sequence 
shown in SEQ ID NO:2 and causes an immune response in a subject. 

8. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 

10 comprises an amino acid sequence at least 90% identical to the amino acid sequence 
shown in SEQ ID NO:2 and causes an immune response in a subject. 

9. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence shown in SEQ ID NO: 2 which polypeptide causes an 

1 5 immune response in a subject. 

1 0. Trie polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence encoded by a nucleic acid molecule which hybridizes 
under high stringency to the nucleotide sequence shown in SEQ ID NO: 1 . 

20 

11. The polypeptide or portion thereof of claim 1 which polypeptide comprises at 
least a portion of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8 
and causes an immune response in a subject. 

25 

12. An isolated or recombinant polypeptide comprising an amino acid sequence 
selected from the group consisting of: LNLKEKP(X1)(X2)TPT(X3) and 
AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, X2 is V or E, X3 is A or V, X4 is 
L or S, X5 is A or V, and X6 is A or V. 

30 



WO 99/63941 



44 



PCT/US99/12929 



13. A polypeptide consisting of an amino acid sequence selected from the group 
consisting of LNLKEKPNVTPTAC and AAHRTSSSRAVVRC. 

14. A vaccine composition for preventing hepatitis C infection in a subject 
5 comprising the polypeptide of claim 1 . 

15. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 2. 

10 16. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 4. 

17. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 7. 

15 

18. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 12. 

19. A vaccine composition for preventing hepatitis C infection in a subject 
20 comprising a nucleic acid encoding polypeptide of claim 1 . 

20. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 2. 

25 2 K A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 4. 

22. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 7. 

30 
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23. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 12. 

24. An antibody which binds to a polypeptide of claim 1 . 

5 

25. A kit for detecting a hepatitis C infection comprising the polypeptide of claim 1 . 

26. A kit for detecting a hepatitis C infection comprising an antibody to the 
polypeptide of claim 1 . 

10 

27. A method of preventing HCV infection comprising administering the 
polypeptide of claim 1 to a subject or by causing said polypeptide to be synthesized is a 
subject prior to HCV infection such that HCV infection is prevented. 

1 5 28. A method of diagnosing HCV infection comprising detecting the presence or 
absence of antibodies which react with the polypeptide of claim 1 in the body fluid of a 
subject, wherein the presence of antibodies which bind the polypeptide is indicative of 
an infection with HCV. 

20 29. A method of diagnosing HCV infection comprising detecting the presence or 

absence of the polypeptide of claim 1 in the body fluid or tissue of a subject, wherein the 
presence of an HCV polypeptide is indicative of an infection with HCV. 
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30. A method for identifying a compound which interacts with the polypeptide of 
claim 1 , comprising: 

contacting said polypeptide with a compound in a cell-free system under 
conditions which allow interaction of the compound with the polypeptide such that a 
5 complex is formed; 

separating the compounds which do not form complexes with an HCV 
polypeptide from those which do form complexes with an HCV polypeptide; and 

isolating and identifying the compounds which form complexes with an HCV 
polypeptide. 
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SEQUENCE LISTING 
<110> Andrea Branch, Jose Walewski, and Dechard Stump 
5 <120> NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 

<130> RII-003PC 

10 <140> 
<141> 

<160> 3 

15 <170> Patentln Ver. 2.0 

<210> 1 

<211> 595 

<212> DNA 

20 <213> Hepatitis C virus 

<400> 1 

gcacgaatcc taaacctcaa agaaaaacca aacgtaacac caaccgtcgc ccacaggacg 60 
tcaagttccc gggtggcggt cagatcgttg gtggagttta cttgttgccg cgcaggggcc 120 

25 ctagattggg tgtgcgcgcg acgaggaaga cttccgagcg gtcgcaacct cgaggtagac 180 
gtcagcctat ccccaaggca cgtcggcccg agggcaggac ctgggctcag cccgggtacc 24 0 
cttggcccct ctatggcaat gagggttgcg ggtgggcggg atggctcctg tctccccgtg 300 
gctctcggcc tagctggggc cccacagacc cccggcgtag gtcgcgcaat ttgggtaagg 360 
tcatcgatac ccttacgtgc ggcttcgccg acctcatggg gtacataccg ctcgtcggcg 420 

30 cccctcttgg aggcgctgcc agggccctgg cgcatggcgt ccgggttctg gaagacggcg 480 
tgaactatgc aacagggaac cttcctggtt gctctttctc tatcttcctt ctggccctgc 540 
tctcttgcct gactgtgccc gcttcagcct accaagtgcg caattcctcg gggct 595 

<210> 2 *~ 
35 <211> 196 
<212> PRT 

<213> Hepatitis C virus 
<400> 2 

40 Ala Arg He Leu Asn Leu Lys Glu Lys Pro Asn Val Thr Pro Thr Val 
15 10 15 

Ala His Arg Thr Ser Ser Ser Arg Val Ala Val Arg Ser Leu Val Glu 
20 25 30 

Phe Thr Cys Cys Arg Ala Gly Ala Leu Asp Trp Val Cys Ala Arg Arg 
35 40 45 



45 



Gly Arg Leu Pro Ser Gly Arg Asn Leu Glu Val Asp Val Ser Leu Ser 
50 50 55 60 

Pro Arg His Val Gly Pro Arg Ala Gly Pro Gly Leu Ser Pro Gly Thr 
65 70 75 80 

55 Leu Gly Pro Ser Met Ala Met Arg Val Ala Gly Gly Arg Asp Gly Ser 

85 90 95 
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Cys Leu Pro Val- Ala Leu Gly Leu Ala Gly Ala Pro Gin Thr Pro Gly 
100 105 110 

Val Gly Arg Ala lie Trp Val Arg Ser Ser He Pro Leu Arg Ala Ala 
5 115 120 125 

Ser Pro Thr Ser Trp Gly Thr Tyr Arg Ser Ser Ala Pro Leu Leu Glu 
130 135 140 

10 Ala Ala Pro Gly Pro Trp Arg Met Ala Ser Gly Phe Trp Lys Thr Ala 
145 150 155 160 



15 



Thr Met Gin Gin Gly Thr Phe Leu Val Ala Leu Ser Leu Ser Ser Phe 
165 170 175 

Trp Pro Cys Ser Leu Ala Leu Cys Pro Leu Gin Pro Thr Lys Cys Ala 
180 135 190 



He Pro Arg Gly 
20 195 



30 



55 



<210> 
<211> 
<212> 
<213> 


3 

13 
PRT 

Hepatitis C 


virus 


<220> 
<223> 


at 


location 


8, X is N or K 


<220> 
<223> 


at 


location 


9, X is V or E 


<220> 
<223> 


at 


location 


13, X is A or V 


<400> 3 

Leu Asn Leu Lys Glu Lys Pro Xaa Xaa 
1 5 


<210> 
<211> 
<212> 
<213> 


4 

13 
PRT 

Hepatitis C 


virus 


<220> 
<223> 


at 


location 


6, X is L or S 


<220> 
<223> 


at 


location 


10, X is A or V 


<220> 
<223> 


at 


location 


11, X is A or V 


<400> 


4 







10 

40 



Ala Ala His Arg Thr Xaa Ser Ser Arg Xaa Xaa Val Arg 
15 10 
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<210> 5 
<211> 14 
5 <212> PRT 

<213> Hepatitis C virus 

<400> 5 

Leu Asn Leu Lys Glu Lys Pro Asn Vai Thr Pro Thr Ala Cys 
10 1 5 10 



<210> 6 

<211> 14 

15 <212> PRT 

<213> Hepatitis C virus 

<400> 6 

Ala Ala His Arg Thr Ser Ser Ser Arg Ala Val Val Arg Cys 
20 1 5 10 



<210> 7 
<211> 18 
25 <212> PRT 

<213> Hepatitis C virus 

<400> 7 

Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp 
30 1 5 10 15 

Thr Cys 



35 

<210> 8 
<211> 20 
<212> PRT 

<213> Hepatitis C virus 

40 

<400> 8 

Gly Cys Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg 
15 10 15 

45 Arg Ala Pro He 
20 
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Full name of sole or first inventor 
Andrea D. BRANCH 



Inventor's signature 



Date 

(Xp^J to . Ixyo ( 



Residence 

923 5 th Avenue, Apartment 6 A, Nej^L^orik, New York 10021, United States of America 



Citizenship 
United States of America 



Post Office Address (if different) 
same as above 



Full name of sole or first inventor 
Jose L. WALEWSKT 



Residence 



95 Tuckahoe Avenue % J^gtch£ste£ New York 10709, United States of America Kj/ 
Citizenship " " " — 

United States of America 



Post Office Address (if different) 
same as above 



Full name of sole or first inventor 
Dechyrd I X STUMP 

Invj 




Date 

Mil /I 2Mj 



fesidence 

529 East 83 rd Street, Apartment 5R, NgLXffljkNew York 10028, United States of America 



Citizenship 
United States of America 



Post Office Address (if different) 
same as above 



